共 36 条
[1]
Alexandridis KP, 2024, Arxiv, DOI arXiv:2407.08567
[2]
Ashish R., 2024, Pattern Anal. Appl., V27
[4]
Berradi Y., 2018, Learning and Optimization Algorithms: Theory and Applications
[7]
Clevert DA, 2016, Arxiv, DOI arXiv:1511.07289
[8]
A Modular Robotic Arm Configuration Design Method Based on Double DQN with Prioritized Experience Replay
[J].
SYMMETRY-BASEL,
2024, 16 (06)
[10]
Proximal Policy Optimization With Policy Feedback
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS,
2022, 52 (07)
:4600-4610