共 26 条
- [11] Multi-Agent Deep Reinforcement Learning for Multi-Robot Applications: A Survey[J]. SENSORS, 2023, 23 (07)Orr, James论文数: 0 引用数: 0 h-index: 0机构: Univ North Florida, Sch Comp, Jacksonville, FL 32224 USA Univ North Florida, Sch Comp, Jacksonville, FL 32224 USADutta, Ayan论文数: 0 引用数: 0 h-index: 0机构: Univ North Florida, Sch Comp, Jacksonville, FL 32224 USA Univ North Florida, Sch Comp, Jacksonville, FL 32224 USA
- [12] Schulman J, 2017, Arxiv, DOI arXiv:1707.06347
- [13] Silver D, 2014, PR MACH LEARN RES, V32
- [14] Mastering the game of Go without human knowledge[J]. NATURE, 2017, 550 (7676) : 354 - +Silver, David论文数: 0 引用数: 0 h-index: 0机构: DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, EnglandSchrittwieser, Julian论文数: 0 引用数: 0 h-index: 0机构: DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, EnglandSimonyan, Karen论文数: 0 引用数: 0 h-index: 0机构: DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, EnglandAntonoglou, Ioannis论文数: 0 引用数: 0 h-index: 0机构: DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, EnglandHuang, Aja论文数: 0 引用数: 0 h-index: 0机构: DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, EnglandGuez, Arthur论文数: 0 引用数: 0 h-index: 0机构: DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, EnglandHubert, Thomas论文数: 0 引用数: 0 h-index: 0机构: DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, EnglandBaker, Lucas论文数: 0 引用数: 0 h-index: 0机构: DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, EnglandLai, Matthew论文数: 0 引用数: 0 h-index: 0机构: DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, EnglandBolton, Adrian论文数: 0 引用数: 0 h-index: 0机构: DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, EnglandChen, Yutian论文数: 0 引用数: 0 h-index: 0机构: DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, EnglandLillicrap, Timothy论文数: 0 引用数: 0 h-index: 0机构: DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, EnglandHui, Fan论文数: 0 引用数: 0 h-index: 0机构: DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, EnglandSifre, Laurent论文数: 0 引用数: 0 h-index: 0机构: DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, Englandvan den Driessche, George论文数: 0 引用数: 0 h-index: 0机构: DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, EnglandGraepel, Thore论文数: 0 引用数: 0 h-index: 0机构: DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, EnglandHassabis, Demis论文数: 0 引用数: 0 h-index: 0机构: DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England
- [15] Mastering the game of Go with deep neural networks and tree search[J]. NATURE, 2016, 529 (7587) : 484 - +Silver, David论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandHuang, Aja论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandMaddison, Chris J.论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandGuez, Arthur论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandSifre, Laurent论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, Englandvan den Driessche, George论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandSchrittwieser, Julian论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandAntonoglou, Ioannis论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandPanneershelvam, Veda论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandLanctot, Marc论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandDieleman, Sander论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandGrewe, Dominik论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandNham, John论文数: 0 引用数: 0 h-index: 0机构: Google, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandKalchbrenner, Nal论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandSutskever, Ilya论文数: 0 引用数: 0 h-index: 0机构: Google, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandLillicrap, Timothy论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandLeach, Madeleine论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandKavukcuoglu, Koray论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandGraepel, Thore论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandHassabis, Demis论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England
- [16] Soleyman S., 2020, P AAAI S 2 WORKSH DE
- [17] Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
- [18] Grandmaster level in StarCraft II using multi-agent reinforcement learning[J]. NATURE, 2019, 575 (7782) : 350 - +Vinyals, Oriol论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandBabuschkin, Igor论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandCzarnecki, Wojciech M.论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandMathieu, Michael论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandDudzik, Andrew论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandChung, Junyoung论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandChoi, David H.论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandPowell, Richard论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandEwalds, Timo论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandGeorgiev, Petko论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandOh, Junhyuk论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandHorgan, Dan论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandKroiss, Manuel论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandDanihelka, Ivo论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandHuang, Aja论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandSifre, Laurent论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandCai, Trevor论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandAgapiou, John P.论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandJaderberg, Max论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandVezhnevets, Alexander S.论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandLeblond, Remi论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandPohlen, Tobias论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandDalibard, Valentin论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandBudden, David论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandSulsky, Yury论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandMolloy, James论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandPaine, Tom L.论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandGulcehre, Caglar论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandWang, Ziyu论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandPfaff, Tobias论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandWu, Yuhuai论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandRing, Roman论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandYogatama, Dani论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandWunsch, Dario论文数: 0 引用数: 0 h-index: 0机构: Team Liquid, Utrecht, Netherlands DeepMind, London, EnglandMcKinney, Katrina论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandSmith, Oliver论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandSchaul, Tom论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandLillicrap, Timothy论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandKavukcuoglu, Koray论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandHassabis, Demis论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandApps, Chris论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandSilver, David论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, England
- [19] Wang X., 2023, P 2023 IEEE T AUT SC
- [20] Wu KD, 2016, 2016 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), P930, DOI 10.1109/CGNCC.2016.7828910