共 47 条
- [1] Bai H(2010)Cooperative load transport: A formation-control perspective IEEE Trans Robot 26 742-750
- [2] Wen JT(2010)Small satellites for global coverage: Potential and limits ISPRS Journal of photogrammetry and Remote Sensing 65 492-504
- [3] Sandau R(2013)Coordinating heterogeneous teams of robots using temporal symbolic planning Auton Robot 34 277-294
- [4] Brieß K(1997)Reinforcement learning in the multi-robot domain Auton Robot 4 73-83
- [5] D’Errico M(2015)Human-level control through deep reinforcement learning Nature 518 529-864
- [6] Wurm KM(2016)Reinforcement learning with multiple shared rewards Procedia Computer Science 80 855-338
- [7] Dornhege C(2008)Analyzing and visualizing multiagent rewards in dynamic and stochastic domains Auton Agent Multi-Agent Syst 17 320-256
- [8] Nebel B(2017)Multiagent cooperation and competition with deep reinforcement learning PloS one 12 e0172395-40
- [9] Burgard W(1992)Simple statistical gradient-following algorithms for connectionist reinforcement learning Machine learning 8 229-4347
- [10] Stachniss C(2016)True online temporal-difference learning J Mach Learn Res 17 1-188