共 50 条
- [1] A Novel Q-learning Algorithm with Function Approximation for Constrained Markov Decision Processes 2012 50TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2012, : 400 - 405
- [3] Safe Q-Learning Method Based on Constrained Markov Decision Processes IEEE ACCESS, 2019, 7 : 165007 - 165017
- [6] Kernelized Q-Learning for Large-Scale, Potentially Continuous, Markov Decision Processes 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018, : 153 - 162