共 50 条
- [1] Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [6] Approximating the value function for continuous space reinforcement learning in robot control 2002 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-3, PROCEEDINGS, 2002, : 1062 - 1067
- [10] HIERARCHICAL REINFORCEMENT LEARNING WITH ADVANTAGE FUNCTION FOR ENTITY RELATION EXTRACTION Journal of Applied and Numerical Optimization, 2022, 4 (03): : 393 - 404