共 50 条
[24]
Non-stationary Risk-Sensitive Reinforcement Learning: Near-Optimal Dynamic Regret, Adaptive Detection, and Separation Design
[J].
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6,
2023,
:7405-7413
[25]
Extreme Risk Averse Policy for Goal-Directed Risk-Sensitive Markov Decision Process
[J].
PROCEEDINGS OF 2016 5TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS 2016),
2016,
:79-84
[26]
RISK-SENSITIVE AVERAGE OPTIMALITY IN MARKOV DECISION PROCESSES
[J].
KYBERNETIKA,
2018, 54 (06)
:1218-1230
[27]
Variational Policy Gradient Method for Reinforcement Learning with General Utilities
[J].
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020,
2020, 33