共 50 条
- [42] Model-Based Reinforcement Learning via Proximal Policy Optimization 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 4736 - 4740
- [44] Policy regularization for legible behavior NEURAL COMPUTING & APPLICATIONS, 2023, 35 (23) : 16781 - 16790
- [45] Policy regularization for legible behavior Neural Computing and Applications, 2023, 35 : 16781 - 16790
- [46] Cautious policy programming: exploiting KL regularization for monotonic policy improvement in reinforcement learning Machine Learning, 2023, 112 : 4527 - 4562
- [48] Tuning Proximal Policy Optimization Algorithm in Maze Solving with ML-Agents ADVANCES IN COMPUTING AND DATA SCIENCES (ICACDS 2022), PT II, 2022, 1614 : 248 - 262
- [50] Federated proximal policy optimization with action masking: Application in collective heating systems Energy and AI, 2025, 20