共 50 条
- [1] Model-free Safe Reinforcement Learning Method Based on Constrained Markov Decision Processes Ruan Jian Xue Bao/Journal of Software, 2022, 33 (08): : 3086 - 3102
- [3] Risk-aware Q-Learning for Markov Decision Processes 2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
- [7] Reinforcement Learning for Constrained Markov Decision Processes 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
- [8] Optimal method for the generation of the attack path based on the Q-learning decision Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2021, 48 (01): : 160 - 167
- [9] Constrained Deep Q-Learning Gradually Approaching Ordinary Q-Learning FRONTIERS IN NEUROROBOTICS, 2019, 13
- [10] Optimal operational control for industrial processes based on Q-learning method PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 2562 - 2567