共 26 条
- [2] Busoniu L, 2010, STUD COMPUT INTELL, V310, P183
- [3] Hessel M, 2018, AAAI CONF ARTIF INTE, P3215
- [4] Reinforcement learning: A survey [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 : 237 - 285
- [5] Källström J, 2020, IEEE SYS MAN CYBERN, P2157, DOI [10.1109/smc42975.2020.9283492, 10.1109/SMC42975.2020.9283492]
- [7] The Design of Simulation System for Multi-UAV Cooperative Guidance [J]. 2015 FIFTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC), 2015, : 1250 - 1254
- [8] Mnih V, 2016, PR MACH LEARN RES, V48
- [9] Human-level control through deep reinforcement learning [J]. NATURE, 2015, 518 (7540) : 529 - 533
- [10] Openai C., 2019, arXiv, DOI DOI 10.48550/ARXIV.1912.06680