Inverse Risk-Sensitive Reinforcement Learning

被引：16

作者：

Ratliff, Lillian J. ^{[1
]}

Mazumdar, Eric ^{[2
]}

机构：

[1] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA

[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2020年 / 65卷 / 03期

基金：

美国国家科学基金会;

关键词：

Autonomous systems; Markov processes; optimization; reinforcement learning; CHOICE;

D O I：

10.1109/TAC.2019.2926674

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This work addresses the problem of inverse reinforcement learning in Markov decision processes where the decision-making agent is risk-sensitive. In particular, a risk-sensitive reinforcement learning algorithm with convergence guarantees that makes use of coherent risk metrics and models of human decision-making which have their origins in behavioral psychology and economics is presented. The risk-sensitive reinforcement learning algorithm provides the theoretical underpinning for a gradient-based inverse reinforcement learning algorithm that seeks to minimize a loss function defined on the observed behavior. It is shown that the gradient of the loss function with respect to the model parameters is well defined and computable via a contraction map argument. Evaluation of the proposed technique is performed on a Grid World example, a canonical benchmark problem.

引用

页码：1256 / 1263

页数：8

共 50 条

[41] Sample-Efficient Multimodal Dynamics Modeling for Risk-Sensitive Reinforcement Learning
Yashima, Ryota
Yamaguchi, Akihiko
Hashimoto, Koichi
2022 8TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND ROBOTICS ENGINEERING (ICMRE 2022), 2022, : 21 - 27
[42] RiskQ: Risk-sensitive Multi-Agent Reinforcement Learning Value Factorization
Shen, Siqi
Ma, Chennan
Li, Chao
Liu, Weiquan
Fu, Yongquan
Mei, Songzhu
Liu, Xinwang
Wang, Cheng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[43] Risk-Sensitive Autonomous Exploration of Unknown Environments: A Deep Reinforcement Learning Perspective
Sarfi, Mohammad Hossein
Bisheban, Mahdis
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2025, 111 (01)
[44] A Tighter Problem-Dependent Regret Bound for Risk-Sensitive Reinforcement Learning
Hu, Xiaoyan
Leung, Ho-Fung
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
[45] Sample-Efficient Multimodal Dynamics Modeling for Risk-Sensitive Reinforcement Learning
Yashima, Ryota
Yamaguchi, Akihiko
Hashimoto, Koichi
2022 8th International Conference on Mechatronics and Robotics Engineering, ICMRE 2022, 2022, : 21 - 27
[46] Embracing Risk in Reinforcement Learning: The Connection between Risk-Sensitive Exponential and Distributionally Robust Criteria
Noorani, Erfaun
Baras, John S.
2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2703 - 2708
[47] Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret
Fei, Yingjie
Yang, Zhuoran
Chen, Yudong
Wang, Zhaoran
Xie, Qiaomin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[48] A Learning Algorithm for Risk-Sensitive Cost
Basu, Arnab
Bhattacharyya, Tirthankar
Borkar, Vivek S.
MATHEMATICS OF OPERATIONS RESEARCH, 2008, 33 (04) : 880 - 898
[49] Policy Gradient Based Entropic-VaR Optimization in Risk-Sensitive Reinforcement Learning
Ni, Xinyi
Lai, Lifeng
2022 58TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2022,
[50] Influence of budget and reinforcement location on risk-sensitive preference
O'Daly, Matthew
Case, David A.
Fantino, Edmund
BEHAVIOURAL PROCESSES, 2006, 73 (02) : 125 - 135

← 1 2 3 4 5 →