共 44 条
[31]
Ramachandran D, 2007, 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2586
[32]
Ross S., 2011, P MACHINE LEARNING R, P627
[33]
Mastering Atari, Go, chess and shogi by planning with a learned model
[J].
Schrittwieser, Julian
;
Antonoglou, Ioannis
;
Hubert, Thomas
;
Simonyan, Karen
;
Sifre, Laurent
;
Schmitt, Simon
;
Guez, Arthur
;
Lockhart, Edward
;
Hassabis, Demis
;
Graepel, Thore
;
Lillicrap, Timothy
;
Silver, David
.
NATURE,
2020, 588 (7839)
:604-+

Schrittwieser, Julian
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Antonoglou, Ioannis
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England
UCL, London, England DeepMind, London, England

Hubert, Thomas
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Simonyan, Karen
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Sifre, Laurent
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Schmitt, Simon
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Guez, Arthur
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Lockhart, Edward
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Hassabis, Demis
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Graepel, Thore
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England
UCL, London, England DeepMind, London, England

Lillicrap, Timothy
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Silver, David
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England
UCL, London, England DeepMind, London, England
[34]
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
[J].
Silver, David
;
Hubert, Thomas
;
Schrittwieser, Julian
;
Antonoglou, Ioannis
;
Lai, Matthew
;
Guez, Arthur
;
Lanctot, Marc
;
Sifre, Laurent
;
Kumaran, Dharshan
;
Graepel, Thore
;
Lillicrap, Timothy
;
Simonyan, Karen
;
Hassabis, Demis
.
SCIENCE,
2018, 362 (6419)
:1140-+

Silver, David
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England
UCL, Gower St, London WC1E 6BT, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Hubert, Thomas
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Schrittwieser, Julian
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Antonoglou, Ioannis
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Lai, Matthew
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Guez, Arthur
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Lanctot, Marc
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Sifre, Laurent
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Kumaran, Dharshan
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Graepel, Thore
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Lillicrap, Timothy
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Simonyan, Karen
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Hassabis, Demis
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England
[35]
Mastering the game of Go without human knowledge
[J].
Silver, David
;
Schrittwieser, Julian
;
Simonyan, Karen
;
Antonoglou, Ioannis
;
Huang, Aja
;
Guez, Arthur
;
Hubert, Thomas
;
Baker, Lucas
;
Lai, Matthew
;
Bolton, Adrian
;
Chen, Yutian
;
Lillicrap, Timothy
;
Hui, Fan
;
Sifre, Laurent
;
van den Driessche, George
;
Graepel, Thore
;
Hassabis, Demis
.
NATURE,
2017, 550 (7676)
:354-+

Silver, David
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Schrittwieser, Julian
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Simonyan, Karen
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Antonoglou, Ioannis
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Huang, Aja
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Guez, Arthur
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Hubert, Thomas
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Baker, Lucas
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Lai, Matthew
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Bolton, Adrian
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Chen, Yutian
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Lillicrap, Timothy
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Hui, Fan
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Sifre, Laurent
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

van den Driessche, George
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Graepel, Thore
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Hassabis, Demis
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England
[36]
Mastering the game of Go with deep neural networks and tree search
[J].
Silver, David
;
Huang, Aja
;
Maddison, Chris J.
;
Guez, Arthur
;
Sifre, Laurent
;
van den Driessche, George
;
Schrittwieser, Julian
;
Antonoglou, Ioannis
;
Panneershelvam, Veda
;
Lanctot, Marc
;
Dieleman, Sander
;
Grewe, Dominik
;
Nham, John
;
Kalchbrenner, Nal
;
Sutskever, Ilya
;
Lillicrap, Timothy
;
Leach, Madeleine
;
Kavukcuoglu, Koray
;
Graepel, Thore
;
Hassabis, Demis
.
NATURE,
2016, 529 (7587)
:484-+

Silver, David
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Huang, Aja
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Maddison, Chris J.
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Guez, Arthur
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Sifre, Laurent
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

van den Driessche, George
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Schrittwieser, Julian
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Antonoglou, Ioannis
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Panneershelvam, Veda
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Lanctot, Marc
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Dieleman, Sander
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Grewe, Dominik
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Nham, John
论文数: 0 引用数: 0
h-index: 0
机构:
Google, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Kalchbrenner, Nal
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Sutskever, Ilya
论文数: 0 引用数: 0
h-index: 0
机构:
Google, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Lillicrap, Timothy
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Leach, Madeleine
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Kavukcuoglu, Koray
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Graepel, Thore
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Hassabis, Demis
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England
[37]
TrafficSim: Learning to Simulate Realistic Multi-Agent Behaviors
[J].
Suo, Simon
;
Regalado, Sebastian
;
Casas, Sergio
;
Urtasun, Raquel
.
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:10395-10404

Suo, Simon
论文数: 0 引用数: 0
h-index: 0
机构:
Uber ATG, Pittsburgh, PA 15201 USA
Univ Toronto, Toronto, ON, Canada Uber ATG, Pittsburgh, PA 15201 USA

Regalado, Sebastian
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Waterloo, Waterloo, ON, Canada Uber ATG, Pittsburgh, PA 15201 USA

Casas, Sergio
论文数: 0 引用数: 0
h-index: 0
机构:
Uber ATG, Pittsburgh, PA 15201 USA
Univ Toronto, Toronto, ON, Canada Uber ATG, Pittsburgh, PA 15201 USA

Urtasun, Raquel
论文数: 0 引用数: 0
h-index: 0
机构:
Uber ATG, Pittsburgh, PA 15201 USA
Univ Toronto, Toronto, ON, Canada Uber ATG, Pittsburgh, PA 15201 USA
[38]
Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
[39]
Tamar A., 2016, ADV NEURAL INFORM PR
[40]
Tesauro G, 1997, ADV NEUR IN, V9, P1068