共 9 条
- [1] Berner C., 2019, ARXIV
- [2] Betz J., 2022, ARXIV
- [3] Brockman G, 2016, Arxiv, DOI [arXiv:1606.01540, DOI 10.48550/ARXIV.1606.01540]
- [4] DeepStack: Expert-level artificial intelligence in heads-up no-limit poker[J]. SCIENCE, 2017, 356 (6337) : 508 - +Moravcik, Matej论文数: 0 引用数: 0 h-index: 0机构: Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada Charles Univ Prague, Dept Appl Math, Prague, Czech Republic Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, CanadaSchmid, Martin论文数: 0 引用数: 0 h-index: 0机构: Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada Charles Univ Prague, Dept Appl Math, Prague, Czech Republic Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, CanadaBurch, Neil论文数: 0 引用数: 0 h-index: 0机构: Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, CanadaLisy, Viliam论文数: 0 引用数: 0 h-index: 0机构: Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada Czech Tech Univ, Dept Comp Sci, Fac Elect Engn, Prague, Czech Republic Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, CanadaMorrill, Dustin论文数: 0 引用数: 0 h-index: 0机构: Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, CanadaBard, Nolan论文数: 0 引用数: 0 h-index: 0机构: Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, CanadaDavis, Trevor论文数: 0 引用数: 0 h-index: 0机构: Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, CanadaWaugh, Kevin论文数: 0 引用数: 0 h-index: 0机构: Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, CanadaJohanson, Michael论文数: 0 引用数: 0 h-index: 0机构: Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, CanadaBowling, Michael论文数: 0 引用数: 0 h-index: 0机构: Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada
- [5] Puterman ML., 1994, Markov decision processes: discrete stochastic dynamic programming. Series in probability and statistics, DOI [10.1002/9780470316887, DOI 10.1002/9780470316887]
- [6] Mastering the game of Go with deep neural networks and tree search[J]. NATURE, 2016, 529 (7587) : 484 - +Silver, David论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandHuang, Aja论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandMaddison, Chris J.论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandGuez, Arthur论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandSifre, Laurent论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, Englandvan den Driessche, George论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandSchrittwieser, Julian论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandAntonoglou, Ioannis论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandPanneershelvam, Veda论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandLanctot, Marc论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandDieleman, Sander论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandGrewe, Dominik论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandNham, John论文数: 0 引用数: 0 h-index: 0机构: Google, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandKalchbrenner, Nal论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandSutskever, Ilya论文数: 0 引用数: 0 h-index: 0机构: Google, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandLillicrap, Timothy论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandLeach, Madeleine论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandKavukcuoglu, Koray论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandGraepel, Thore论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, EnglandHassabis, Demis论文数: 0 引用数: 0 h-index: 0机构: Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England
- [7] Sutton R.S., 2005, IEEE Trans. Neural Netw., V16, P285
- [8] Grandmaster level in StarCraft II using multi-agent reinforcement learning[J]. NATURE, 2019, 575 (7782) : 350 - +Vinyals, Oriol论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandBabuschkin, Igor论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandCzarnecki, Wojciech M.论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandMathieu, Michael论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandDudzik, Andrew论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandChung, Junyoung论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandChoi, David H.论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandPowell, Richard论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandEwalds, Timo论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandGeorgiev, Petko论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandOh, Junhyuk论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandHorgan, Dan论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandKroiss, Manuel论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandDanihelka, Ivo论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandHuang, Aja论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandSifre, Laurent论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandCai, Trevor论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandAgapiou, John P.论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandJaderberg, Max论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandVezhnevets, Alexander S.论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandLeblond, Remi论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandPohlen, Tobias论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandDalibard, Valentin论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandBudden, David论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandSulsky, Yury论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandMolloy, James论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandPaine, Tom L.论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandGulcehre, Caglar论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandWang, Ziyu论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandPfaff, Tobias论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandWu, Yuhuai论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandRing, Roman论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandYogatama, Dani论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandWunsch, Dario论文数: 0 引用数: 0 h-index: 0机构: Team Liquid, Utrecht, Netherlands DeepMind, London, EnglandMcKinney, Katrina论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandSmith, Oliver论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandSchaul, Tom论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandLillicrap, Timothy论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandKavukcuoglu, Koray论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandHassabis, Demis论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandApps, Chris论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, EnglandSilver, David论文数: 0 引用数: 0 h-index: 0机构: DeepMind, London, England DeepMind, London, England
- [9] Outracing champion Gran Turismo drivers with deep reinforcement learning[J]. NATURE, 2022, 602 (7896) : 223 - +Wurman, Peter R.论文数: 0 引用数: 0 h-index: 0机构: Sony AI, New York, NY 10036 USA Sony AI, New York, NY 10036 USABarrett, Samuel论文数: 0 引用数: 0 h-index: 0机构: Sony AI, New York, NY 10036 USA Sony AI, New York, NY 10036 USAKawamoto, Kenta论文数: 0 引用数: 0 h-index: 0机构: Sony AI, Tokyo, Japan Sony AI, New York, NY 10036 USAMacGlashan, James论文数: 0 引用数: 0 h-index: 0机构: Sony AI, New York, NY 10036 USA Sony AI, New York, NY 10036 USASubramanian, Kaushik论文数: 0 引用数: 0 h-index: 0机构: Sony AI, Zurich, Switzerland Sony AI, New York, NY 10036 USAWalsh, Thomas J.论文数: 0 引用数: 0 h-index: 0机构: Sony AI, New York, NY 10036 USA Sony AI, New York, NY 10036 USACapobianco, Roberto论文数: 0 引用数: 0 h-index: 0机构: Sony AI, Zurich, Switzerland Sony AI, New York, NY 10036 USADevlic, Alisa论文数: 0 引用数: 0 h-index: 0机构: Sony AI, Zurich, Switzerland Sony AI, New York, NY 10036 USAEckert, Franziska论文数: 0 引用数: 0 h-index: 0机构: Sony AI, Zurich, Switzerland Sony AI, New York, NY 10036 USAFuchs, Florian论文数: 0 引用数: 0 h-index: 0机构: Sony AI, Zurich, Switzerland Sony AI, New York, NY 10036 USAGilpin, Leilani论文数: 0 引用数: 0 h-index: 0机构: Sony AI, New York, NY 10036 USA Sony AI, New York, NY 10036 USAKhandelwal, Piyush论文数: 0 引用数: 0 h-index: 0机构: Sony AI, New York, NY 10036 USA Sony AI, New York, NY 10036 USAKompella, Varun论文数: 0 引用数: 0 h-index: 0机构: Sony AI, New York, NY 10036 USA Sony AI, New York, NY 10036 USALin, HaoChih论文数: 0 引用数: 0 h-index: 0机构: Sony AI, Zurich, Switzerland Sony AI, New York, NY 10036 USAMacAlpine, Patrick论文数: 0 引用数: 0 h-index: 0机构: Sony AI, New York, NY 10036 USA Sony AI, New York, NY 10036 USAOller, Declan论文数: 0 引用数: 0 h-index: 0机构: Sony AI, New York, NY 10036 USA Sony AI, New York, NY 10036 USASeno, Takuma论文数: 0 引用数: 0 h-index: 0机构: Sony AI, Tokyo, Japan Sony AI, New York, NY 10036 USASherstan, Craig论文数: 0 引用数: 0 h-index: 0机构: Sony AI, New York, NY 10036 USA Sony AI, New York, NY 10036 USAThomure, Michael D.论文数: 0 引用数: 0 h-index: 0机构: Sony AI, New York, NY 10036 USA Sony AI, New York, NY 10036 USAAghabozorgi, Houmehr论文数: 0 引用数: 0 h-index: 0机构: Sony AI, New York, NY 10036 USA Sony AI, New York, NY 10036 USABarrett, Leon论文数: 0 引用数: 0 h-index: 0机构: Sony AI, New York, NY 10036 USA Sony AI, New York, NY 10036 USADouglas, Rory论文数: 0 引用数: 0 h-index: 0机构: Sony AI, New York, NY 10036 USA Sony AI, New York, NY 10036 USAWhitehead, Dion论文数: 0 引用数: 0 h-index: 0机构: Sony AI, New York, NY 10036 USA Sony AI, New York, NY 10036 USADuerr, Peter论文数: 0 引用数: 0 h-index: 0机构: Sony AI, Zurich, Switzerland Sony AI, New York, NY 10036 USAStone, Peter论文数: 0 引用数: 0 h-index: 0机构: Sony AI, New York, NY 10036 USA Sony AI, New York, NY 10036 USASpranger, Michael论文数: 0 引用数: 0 h-index: 0机构: Sony AI, Tokyo, Japan Sony AI, New York, NY 10036 USAKitano, Hiroaki论文数: 0 引用数: 0 h-index: 0机构: Sony AI, Tokyo, Japan Sony AI, New York, NY 10036 USA