Incorporation of ordinal optimization into learning automata for high learning efficiency
Document Type
Conference Proceeding
Publication Date
10-7-2015
Abstract
Learning automata (LA) represent important leaning mechanisms with applications in automated system design, biological system modeling, computer vision, and transportation. They play the critical roles in modeling a process as well as generating the appropriate signal to control it. They update their action probabilities in accordance with the inputs received from the environment and can improve their own performance during operations. The action probability vector in LA takes charge of two functions: 1) The cost of convergence, i.e., the size of sampling budget; 2) The allocation of sampling budget among actions to identify the optimal one. These two intertwined functions lead to a problem: The sampling budget mostly goes to the currently estimated optimal action due to its high action probability regardless whether it can help identify the real optimal action or not. This work proposes a new class of LA that separates the allocation of sampling budget from the action probability vector. It uses the action probability vector to determine the size of sampling budget and then uses Optimal Computing Budget Allocation (OCBA) to accomplish the allocation of sampling budget in a way that maximizes the probability of identifying the true optimal action. Simulation results verify its significant speedup ranging from 10.93% to 65.94% over the best existing LA algorithms.
Identifier
84952787991 (Scopus)
ISBN
[9781467381833]
Publication Title
IEEE International Conference on Automation Science and Engineering
External Full Text Location
https://doi.org/10.1109/CoASE.2015.7294262
e-ISSN
21618089
ISSN
21618070
First Page
1206
Last Page
1211
Volume
2015-October
Recommended Citation
Zhang, Junqi; Wang, Cheng; Zang, Di; and Zhou, Mengchu, "Incorporation of ordinal optimization into learning automata for high learning efficiency" (2015). Faculty Publications. 6733.
https://digitalcommons.njit.edu/fac_pubs/6733
