共 62 条
[1]
Alotaibi ETS, 2016, INT J ADV COMPUT SC, V7, P179
[3]
Azar M.G., 2011, ADV NEURAL INF PROCE, V2011, P2411
[4]
Multi-Robot Path Planning Method Using Reinforcement Learning
[J].
APPLIED SCIENCES-BASEL,
2019, 9 (15)
[5]
Chen L., 2019, THESIS BEIJING JIAOT
[8]
Chu J., 2022, CHINAS IND INFORM, V28, P40, DOI [10.19609/j.cnki.cn10-1299/f.2022.04.010, DOI 10.19609/J.CNKI.CN10-1299/F.2022.04.010]
[9]
Cornish Hellaby Watkins ChristopherJohn., 1989, LEARNING DELAYED REW