共 50 条
- [1] A policy gradient reinforcement learning algorithm with fuzzy function approximation IEEE ROBIO 2004: Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2004, : 936 - 940
- [2] Reinforcement Learning to Rank with Pairwise Policy Gradient PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 509 - 518
- [6] A Residual Gradient Fuzzy Reinforcement Learning Algorithm for Differential Games International Journal of Fuzzy Systems, 2017, 19 : 1058 - 1076
- [8] Adaptive Natural Policy Gradient in Reinforcement Learning PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS), 2018, : 605 - 610
- [9] Traffic Light Control with Policy Gradient-Based Reinforcement Learning 32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
- [10] Survey of Deep Reinforcement Learning Based on Value Function and Policy Gradient Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (06): : 1406 - 1438