共 7 条
[1]
Busoniu L, 2010, STUD COMPUT INTELL, V310, P183
[2]
Mnih V., 2013, ARXIV, V1312, P5602, DOI DOI 10.48550/ARXIV.1312.5602
[3]
Human-level control through deep reinforcement learning
[J].
Mnih, Volodymyr
;
Kavukcuoglu, Koray
;
Silver, David
;
Rusu, Andrei A.
;
Veness, Joel
;
Bellemare, Marc G.
;
Graves, Alex
;
Riedmiller, Martin
;
Fidjeland, Andreas K.
;
Ostrovski, Georg
;
Petersen, Stig
;
Beattie, Charles
;
Sadik, Amir
;
Antonoglou, Ioannis
;
King, Helen
;
Kumaran, Dharshan
;
Wierstra, Daan
;
Legg, Shane
;
Hassabis, Demis
.
NATURE,
2015, 518 (7540)
:529-533

Mnih, Volodymyr
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

Kavukcuoglu, Koray
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

Silver, David
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

Rusu, Andrei A.
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

Veness, Joel
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

Bellemare, Marc G.
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

Graves, Alex
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

Riedmiller, Martin
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

Fidjeland, Andreas K.
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

Ostrovski, Georg
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

Petersen, Stig
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

Beattie, Charles
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

Sadik, Amir
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

Antonoglou, Ioannis
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

King, Helen
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

Kumaran, Dharshan
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

Wierstra, Daan
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

Legg, Shane
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England

Hassabis, Demis
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, London EC4A 3TW, England Google DeepMind, London EC4A 3TW, England
[4]
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
[J].
Silver, David
;
Hubert, Thomas
;
Schrittwieser, Julian
;
Antonoglou, Ioannis
;
Lai, Matthew
;
Guez, Arthur
;
Lanctot, Marc
;
Sifre, Laurent
;
Kumaran, Dharshan
;
Graepel, Thore
;
Lillicrap, Timothy
;
Simonyan, Karen
;
Hassabis, Demis
.
SCIENCE,
2018, 362 (6419)
:1140-+

Silver, David
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England
UCL, Gower St, London WC1E 6BT, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Hubert, Thomas
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Schrittwieser, Julian
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Antonoglou, Ioannis
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Lai, Matthew
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Guez, Arthur
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Lanctot, Marc
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Sifre, Laurent
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Kumaran, Dharshan
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Graepel, Thore
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Lillicrap, Timothy
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Simonyan, Karen
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England

Hassabis, Demis
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 6 Pancras Sq, London N1C 4AG, England DeepMind, 6 Pancras Sq, London N1C 4AG, England
[5]
Mastering the game of Go without human knowledge
[J].
Silver, David
;
Schrittwieser, Julian
;
Simonyan, Karen
;
Antonoglou, Ioannis
;
Huang, Aja
;
Guez, Arthur
;
Hubert, Thomas
;
Baker, Lucas
;
Lai, Matthew
;
Bolton, Adrian
;
Chen, Yutian
;
Lillicrap, Timothy
;
Hui, Fan
;
Sifre, Laurent
;
van den Driessche, George
;
Graepel, Thore
;
Hassabis, Demis
.
NATURE,
2017, 550 (7676)
:354-+

Silver, David
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Schrittwieser, Julian
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Simonyan, Karen
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Antonoglou, Ioannis
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Huang, Aja
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Guez, Arthur
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Hubert, Thomas
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Baker, Lucas
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Lai, Matthew
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Bolton, Adrian
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Chen, Yutian
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Lillicrap, Timothy
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Hui, Fan
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Sifre, Laurent
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

van den Driessche, George
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Graepel, Thore
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Hassabis, Demis
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England
[6]
Mastering the game of Go with deep neural networks and tree search
[J].
Silver, David
;
Huang, Aja
;
Maddison, Chris J.
;
Guez, Arthur
;
Sifre, Laurent
;
van den Driessche, George
;
Schrittwieser, Julian
;
Antonoglou, Ioannis
;
Panneershelvam, Veda
;
Lanctot, Marc
;
Dieleman, Sander
;
Grewe, Dominik
;
Nham, John
;
Kalchbrenner, Nal
;
Sutskever, Ilya
;
Lillicrap, Timothy
;
Leach, Madeleine
;
Kavukcuoglu, Koray
;
Graepel, Thore
;
Hassabis, Demis
.
NATURE,
2016, 529 (7587)
:484-+

Silver, David
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Huang, Aja
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Maddison, Chris J.
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Guez, Arthur
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Sifre, Laurent
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

van den Driessche, George
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Schrittwieser, Julian
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Antonoglou, Ioannis
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Panneershelvam, Veda
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Lanctot, Marc
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Dieleman, Sander
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Grewe, Dominik
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Nham, John
论文数: 0 引用数: 0
h-index: 0
机构:
Google, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Kalchbrenner, Nal
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Sutskever, Ilya
论文数: 0 引用数: 0
h-index: 0
机构:
Google, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Lillicrap, Timothy
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Leach, Madeleine
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Kavukcuoglu, Koray
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Graepel, Thore
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Hassabis, Demis
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England
[7]
Scalable Multi-Agent Computational Guidance with Separation Assurance for Autonomous Urban Air Mobility
[J].
Yang, Xuxi
;
Wei, Peng
.
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS,
2020, 43 (08)
:1473-1486

Yang, Xuxi
论文数: 0 引用数: 0
h-index: 0
机构:
Iowa State Univ, Dept Aerosp Engn, Ames, IA 50011 USA Iowa State Univ, Dept Aerosp Engn, Ames, IA 50011 USA

Wei, Peng
论文数: 0 引用数: 0
h-index: 0
机构:
Iowa State Univ, Dept Aerosp Engn, Ames, IA 50011 USA Iowa State Univ, Dept Aerosp Engn, Ames, IA 50011 USA