Regret Bounds for Risk-Sensitive Reinforcement Learning
被引:0
作者:
Bastani, Osbert
论文数: 0引用数: 0
h-index: 0
机构:
Univ Penn, Philadelphia, PA 19104 USAUniv Penn, Philadelphia, PA 19104 USA
Bastani, Osbert
[1
]
Ma, Yecheng Jason
论文数: 0引用数: 0
h-index: 0
机构:
Univ Penn, Philadelphia, PA 19104 USAUniv Penn, Philadelphia, PA 19104 USA
Ma, Yecheng Jason
[1
]
Shen, Estelle
论文数: 0引用数: 0
h-index: 0
机构:
Univ Penn, Philadelphia, PA 19104 USAUniv Penn, Philadelphia, PA 19104 USA
Shen, Estelle
[1
]
Xu, Wanqiao
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Stanford, CA USAUniv Penn, Philadelphia, PA 19104 USA
Xu, Wanqiao
[2
]
机构:
[1] Univ Penn, Philadelphia, PA 19104 USA
[2] Stanford Univ, Stanford, CA USA
来源:
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022
|
2022年
关键词:
VALUE-AT-RISK;
D O I:
暂无
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
In safety-critical applications of reinforcement learning such as healthcare and robotics, it is often desirable to optimize risk-sensitive objectives that account for tail outcomes rather than expected reward. We prove the first regret bounds for reinforcement learning under a general class of risk-sensitive objectives including the popular CVaR objective. Our theory is based on a novel characterization of the CVaR objective as well as a novel optimistic MDP construction.
机构:
Indian Inst Technol Madras, Dept Elect Engn, Chennai 600036, Tamil Nadu, IndiaIndian Inst Technol Madras, Dept Elect Engn, Chennai 600036, Tamil Nadu, India
Kolla, Ravi Kumar
Prashanth, L. A.
论文数: 0引用数: 0
h-index: 0
机构:
Indian Inst Technol Madras, Dept Comp Sci & Engn, Chennai 600036, Tamil Nadu, IndiaIndian Inst Technol Madras, Dept Elect Engn, Chennai 600036, Tamil Nadu, India
Prashanth, L. A.
Bhat, Sanjay P.
论文数: 0引用数: 0
h-index: 0
机构:
Tata Consultancy Serv Ltd, Hyderabad 500081, Telangana, IndiaIndian Inst Technol Madras, Dept Elect Engn, Chennai 600036, Tamil Nadu, India
Bhat, Sanjay P.
Jagannathan, Krishna
论文数: 0引用数: 0
h-index: 0
机构:
Indian Inst Technol Madras, Dept Elect Engn, Chennai 600036, Tamil Nadu, IndiaIndian Inst Technol Madras, Dept Elect Engn, Chennai 600036, Tamil Nadu, India
机构:
Georgia Inst Technol, Sch Math, Atlanta, GA 30332 USA
Univ Waterloo, Dept Stat & Actuarial Sci, Waterloo, ON N2L 3G1, CanadaGeorgia Inst Technol, Sch Math, Atlanta, GA 30332 USA
Wang, Ruodu
Peng, Liang
论文数: 0引用数: 0
h-index: 0
机构:
Georgia Inst Technol, Sch Math, Atlanta, GA 30332 USAGeorgia Inst Technol, Sch Math, Atlanta, GA 30332 USA
Peng, Liang
Yang, Jingping
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Ctr Stat Sci, Dept Financial Math, LMEQF, Beijing 100871, Peoples R China
Peking Univ, Ctr Stat Sci, Dept Financial Math, LMAM, Beijing 100871, Peoples R ChinaGeorgia Inst Technol, Sch Math, Atlanta, GA 30332 USA
机构:
Indian Inst Technol Madras, Dept Elect Engn, Chennai 600036, Tamil Nadu, IndiaIndian Inst Technol Madras, Dept Elect Engn, Chennai 600036, Tamil Nadu, India
Kolla, Ravi Kumar
Prashanth, L. A.
论文数: 0引用数: 0
h-index: 0
机构:
Indian Inst Technol Madras, Dept Comp Sci & Engn, Chennai 600036, Tamil Nadu, IndiaIndian Inst Technol Madras, Dept Elect Engn, Chennai 600036, Tamil Nadu, India
Prashanth, L. A.
Bhat, Sanjay P.
论文数: 0引用数: 0
h-index: 0
机构:
Tata Consultancy Serv Ltd, Hyderabad 500081, Telangana, IndiaIndian Inst Technol Madras, Dept Elect Engn, Chennai 600036, Tamil Nadu, India
Bhat, Sanjay P.
Jagannathan, Krishna
论文数: 0引用数: 0
h-index: 0
机构:
Indian Inst Technol Madras, Dept Elect Engn, Chennai 600036, Tamil Nadu, IndiaIndian Inst Technol Madras, Dept Elect Engn, Chennai 600036, Tamil Nadu, India
机构:
Georgia Inst Technol, Sch Math, Atlanta, GA 30332 USA
Univ Waterloo, Dept Stat & Actuarial Sci, Waterloo, ON N2L 3G1, CanadaGeorgia Inst Technol, Sch Math, Atlanta, GA 30332 USA
Wang, Ruodu
Peng, Liang
论文数: 0引用数: 0
h-index: 0
机构:
Georgia Inst Technol, Sch Math, Atlanta, GA 30332 USAGeorgia Inst Technol, Sch Math, Atlanta, GA 30332 USA
Peng, Liang
Yang, Jingping
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Ctr Stat Sci, Dept Financial Math, LMEQF, Beijing 100871, Peoples R China
Peking Univ, Ctr Stat Sci, Dept Financial Math, LMAM, Beijing 100871, Peoples R ChinaGeorgia Inst Technol, Sch Math, Atlanta, GA 30332 USA