共 28 条
[2]
Potential-based reward shaping using state-space segmentation for efficiency in reinforcement learning
[J].
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE,
2024, 157
:469-484
[3]
Bernabei M, 2023, Global Journal of Flexible Systems Management, V24, P67, DOI [10.1007/s40171-022-00328-7, 10.1007/s40171-022-00328-7, DOI 10.1007/S40171-022-00328-7]
[5]
Chellaboina S, 2022, 2022 INT C ADV TECHN, P1, DOI [10.1109/ICONAT53423.2022.9725880, DOI 10.1109/ICONAT53423.2022.9725880]
[8]
Proximal Policy Optimization With Policy Feedback
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS,
2022, 52 (07)
:4600-4610
[10]
Huang J. -P., 2024, IEEE Transactions on Automation Science and Engineering