Recent advances in hierarchical reinforcement learning

被引：3

作者：

Barto, AG ^{[1
]}

Mahadevan, S ^{[1
]}

机构：

[1] Univ Massachusetts, Dept Comp Sci, Autonomous Learning Lab, Amherst, MA 01003 USA

来源：

DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS | 2003年 / 13卷 / 1-2期

基金：

美国国家科学基金会;

关键词：

D O I：

10.1023/A:1022140919877

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement learning is bedeviled by the curse of dimensionality: the number of parameters to be learned grows exponentially with the size of any compact encoding of a state. Recent attempts to combat the curse of dimensionality have turned to principled ways of exploiting temporal abstraction, where decisions are not required at each step, but rather invoke the execution of temporally-extended activities which follow their own policies until termination. This leads naturally to hierarchical control architectures and associated learning algorithms. We review several approaches to temporal abstraction and hierarchical organization that machine learning researchers have recently developed. Common to these approaches is a reliance on the theory of semi-Markov decision processes, which we emphasize in our review. We then discuss extensions of these ideas to concurrent activities, multiagent coordination, and hierarchical memory for addressing partial observability. Concluding remarks address open challenges facing the further development of reinforcement learning in a hierarchical setting.

引用

页码：41 / 77

页数：37

共 50 条

[21] A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning
Kwan, Wai-Chung
Wang, Hong-Ru
Wang, Hui-Min
Wong, Kam-Fai
MACHINE INTELLIGENCE RESEARCH, 2023, 20 (03) : 318 - 334
[22] A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning
Wai-Chung Kwan
Hong-Ru Wang
Hui-Min Wang
Kam-Fai Wong
Machine Intelligence Research, 2023, 20 : 318 - 334
[23] Recent advances in reinforcement learning-based autonomous driving behavior planning: A survey
Wu, Jingda
Huang, Chao
Huang, Hailong
Lv, Chen
Wang, Yuntong
Wang, Fei-Yue
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2024, 164
[24] Recent advances in applying deep reinforcement learning for flow control: Perspectives and future directions
Vignon, C.
Rabault, J.
Vinuesa, R.
PHYSICS OF FLUIDS, 2023, 35 (03)
[25] A Survey on recent advances in reinforcement learning for intelligent investment decision-making optimization
Wang, Feng
Li, Shicheng
Niu, Shanshui
Yang, Haoran
Li, Xiaodong
Deng, Xiaotie
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 282
[26] Reinforcement Learning for Selective Key Applications in Power Systems: Recent Advances and Future Challenges
Chen, Xin
Qu, Guannan
Tang, Yujie
Low, Steven
Li, Na
IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (04) : 2935 - 2958
[27] Recent Advances in Mechanical Reinforcement of Zwitterionic Hydrogels
Lin, Weifeng
Wei, Xinyue
Liu, Sihang
Zhang, Juan
Yang, Tian
Chen, Shengfu
GELS, 2022, 8 (09)
[28] Hierarchical Reinforcement Learning for Quadruped Locomotion
Jain, Deepali
Iscen, Atil
Caluwaerts, Ken
2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 7551 - 7557
[29] Reinforcement Active Learning Hierarchical Loops
Gordon, Goren
Ahissar, Ehud
2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 3008 - 3015
[30] Hierarchical Reinforcement Learning With Timed Subgoals
Guertler, Nico
Buechler, Dieter
Martius, Georg
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34

← 1 2 3 4 5 →