共 90 条
- [71] TEMPORAL DIFFERENCE LEARNING AND TD-GAMMON [J]. COMMUNICATIONS OF THE ACM, 1995, 38 (03) : 58 - 68
- [72] Thin L.N., 2016, Int. J. Comput. Network. Commun., V8, P123, DOI 10.5121/ijcnc.2016.8211
- [73] Thipphavong D. P., 2018, P AV TECHN INT OP C, P3676, DOI [10.2514/6.2018-3676, DOI 10.2514/6.2018-3676]
- [74] Van Rossum G., 2009, PYTHON 3 REFERENCE M
- [75] Waltz M., 2022, Rl dresden algorithm suite
- [76] Waltz M, 2024, Arxiv, DOI arXiv:2307.16769
- [77] Spatial-temporal recurrent reinforcement learning for autonomous ships [J]. NEURAL NETWORKS, 2023, 165 : 634 - 653
- [78] Distributed Reinforcement Learning for Robot Teams: a Review [J]. Current Robotics Reports, 2022, 3 (4): : 239 - 257
- [80] WHITLEY D, 1994, STAT COMPUT, V4, P65, DOI 10.1007/BF00175354