共 50 条
- [21] Risk-constrained Markov Decision Processes 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 2664 - 2669
- [22] Intelligent Decision Method of Slope Perturbing Based on Q-Learning for Anti-Deception Jamming 2022 6TH INTERNATIONAL CONFERENCE ON IMAGING, SIGNAL PROCESSING AND COMMUNICATIONS, ICISPC, 2022, : 71 - 76
- [23] Dynamic programming in constrained Markov decision processes CONTROL AND CYBERNETICS, 2006, 35 (03): : 645 - 660
- [24] A type of Q-learning method based on Elman network Proceedings of 2004 Chinese Control and Decision Conference, 2004, : 562 - 564
- [27] Safe Exploration in Finite Markov Decision Processes with Gaussian Processes ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29