Multiresolution state-space discretization for Q-Learning with pseudorandomized discretization

被引:3
作者
Lampton A. [1 ]
Valasek J. [2 ]
Kumar M. [3 ]
机构
[1] Systems Technology, Inc., Hawthorne, CA 90250
[2] Department of Aerospace Engineering, Texas A and M University, College Station, TX 77843-3141
[3] Department of Mechanical and Aerospace Engineering, University of Florida, Gainesville, FL 32611-6250
来源
Journal of Control Theory and Applications | 2011年 / 9卷 / 3期
基金
美国国家科学基金会;
关键词
Morphing; Random grid; Reinforcement learning;
D O I
10.1007/s11768-011-1012-4
中图分类号
学科分类号
摘要
A multiresolution state-space discretization method with pseudorandom gridding is developed for the episodic unsupervised learning method of Q-learning. It is used as the learning agent for closed-loop control of morphing or highly reconfigurable systems. This paper develops a method whereby a state-space is adaptively discretized by progressively finer pseudorandom grids around the regions of interest within the state or learning space in an effort to break the Curse of Dimensionality. Utility of the method is demonstrated with application to the problem of a morphing airfoil, which is simulated by a computationally intensive computational fluid dynamics model. By setting the multiresolution method to define the region of interest by the goal the agent seeks, it is shown that this method with the pseudorandom grid can learn a specific goal within ±0.001 while reducing the total number of state-action pairs needed to achieve this level of specificity to less than 3000. © 2011 South China University of Technology, Academy of Mathematics and Systems Science, Chinese Academy of Sciences and Springer-Verlag Berlin Heidelberg.
引用
收藏
页码:431 / 439
页数:8
相关论文
共 34 条
  • [31] SELECTIVE INITIAL STATE CRITERIA TO ENHANCE CONVERGANCE RATE OF Q-LEARNING ALGOTITHM IN POWER SYSTEM STABILITY APPLICATION
    Hadidi, Ramtin
    Jeyasurya, Benjamin
    [J]. 2009 IEEE 22ND CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1 AND 2, 2009, : 895 - 898
  • [32] Dynamic robot routing optimization: State-space decomposition for operations research-informed reinforcement learning
    Loeppenberg, Marlon
    Yuwono, Steve
    Diprasetya, Mochammad Rizky
    Schwung, Andreas
    [J]. ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2024, 90
  • [33] Secure State Estimation of Cyber-Physical System under Cyber Attacks: Q-Learning vs. SARSA
    Jin, Zengwang
    Ma, Menglu
    Zhang, Shuting
    Hu, Yanyan
    Zhang, Yanning
    Sun, Changyin
    [J]. ELECTRONICS, 2022, 11 (19)
  • [34] A state-space representation model and learning algorithm for real-time decision-making under uncertainty
    Malikopoulos, Andreas A.
    Assanis, Dennis N.
    Papalambros, Panos Y.
    [J]. PROCEEDINGS OF THE ASME INTERNATIONAL MECHANICAL ENGINERING CONGRESS AND EXPOSITION 2007, VOL 9, PTS A-C: MECHANICAL SYSTEMS AND CONTROL, 2008, : 575 - 584