A Review of Inverse Reinforcement Learning Theory and Recent Advances

被引:0
|
作者
Shao Zhifei [1 ]
Joo, Er Meng [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
来源
2012 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC) | 2012年
关键词
Reinforcement learning; inverse reinforcement learning; reward function; expert demonstration; ROBOT;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A major challenge faced by machine learning community is the decision making problems under uncertainty. Reinforcement Learning (RL) techniques provide a powerful solution for it. An agent used by RL interacts with a dynamic environment and finds a policy through a reward function, without using target labels like Supervised Learning (SL). However, one fundamental assumption of existing RL algorithms is that reward function, the most succinct representation of the designer's intention, needs to be provided beforehand. In practice, the reward function can be very hard to specify and exhaustive to tune for large and complex problems, and this inspires the development of Inverse Reinforcement Learning (IRL), an extension of RL, which directly tackles this problem by learning the reward function through expert demonstrations. IRL introduces a new way of learning policies by deriving expert's intentions, in contrast to directly learning policies, which can be redundant and have poor generalization ability. In this paper, the original IRL algorithms and its close variants, as well as their recent advances are reviewed and compared.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Inverse Constrained Reinforcement Learning
    Malik, Shehryar
    Anwar, Usman
    Aghasi, Alireza
    Ahmed, Ali
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [42] Inverse reinforcement learning with evaluation
    da Silva, Valdinei Freire
    Reali Costa, Anna Helena
    Lima, Pedro
    2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-10, 2006, : 4246 - +
  • [43] Identifiability in inverse reinforcement learning
    Cao, Haoyang
    Cohen, Samuel N.
    Szpruch, Lukasz
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [44] A survey of inverse reinforcement learning
    Adams, Stephen
    Cody, Tyler
    Beling, Peter A.
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (06) : 4307 - 4346
  • [45] Survey on Inverse Reinforcement Learning
    Zhang L.-H.
    Liu Q.
    Huang Z.-G.
    Zhu F.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (10): : 4772 - 4803
  • [46] A survey of inverse reinforcement learning
    Stephen Adams
    Tyler Cody
    Peter A. Beling
    Artificial Intelligence Review, 2022, 55 : 4307 - 4346
  • [47] Decentralized multi-agent reinforcement learning with networked agents: recent advances
    Zhang, Kaiqing
    Yang, Zhuoran
    Basar, Tamer
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2021, 22 (06) : 802 - 814
  • [48] Advances in inverse galois theory
    Matzat, BH
    ALGORITHMIC ALGEBRA AND NUMBER THEORY, 1998, : 79 - 102
  • [49] Recent advances in generalized thermoelasticity theory and the modified models: a review
    Shakeriaski, Farshad
    Ghodrat, Maryam
    Escobedo-Diaz, Juan
    Behnia, Masud
    JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2021, 8 (01) : 15 - 35
  • [50] Cognitive Dynamic Systems: A Review of Theory, Applications, and Recent Advances
    Hilal, Waleed
    Gadsden, S. Andrew
    Yawney, John
    PROCEEDINGS OF THE IEEE, 2023, 111 (06) : 575 - 622