共 40 条
[31]
Mastering the game of Go without human knowledge
[J].
Silver, David
;
Schrittwieser, Julian
;
Simonyan, Karen
;
Antonoglou, Ioannis
;
Huang, Aja
;
Guez, Arthur
;
Hubert, Thomas
;
Baker, Lucas
;
Lai, Matthew
;
Bolton, Adrian
;
Chen, Yutian
;
Lillicrap, Timothy
;
Hui, Fan
;
Sifre, Laurent
;
van den Driessche, George
;
Graepel, Thore
;
Hassabis, Demis
.
NATURE,
2017, 550 (7676)
:354-+

Silver, David
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Schrittwieser, Julian
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Simonyan, Karen
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Antonoglou, Ioannis
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Huang, Aja
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Guez, Arthur
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Hubert, Thomas
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Baker, Lucas
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Lai, Matthew
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Bolton, Adrian
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Chen, Yutian
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Lillicrap, Timothy
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Hui, Fan
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Sifre, Laurent
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

van den Driessche, George
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Graepel, Thore
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England

Hassabis, Demis
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, 5 New St Sq, London EC4A 3TW, England DeepMind, 5 New St Sq, London EC4A 3TW, England
[32]
Mastering the game of Go with deep neural networks and tree search
[J].
Silver, David
;
Huang, Aja
;
Maddison, Chris J.
;
Guez, Arthur
;
Sifre, Laurent
;
van den Driessche, George
;
Schrittwieser, Julian
;
Antonoglou, Ioannis
;
Panneershelvam, Veda
;
Lanctot, Marc
;
Dieleman, Sander
;
Grewe, Dominik
;
Nham, John
;
Kalchbrenner, Nal
;
Sutskever, Ilya
;
Lillicrap, Timothy
;
Leach, Madeleine
;
Kavukcuoglu, Koray
;
Graepel, Thore
;
Hassabis, Demis
.
NATURE,
2016, 529 (7587)
:484-+

Silver, David
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Huang, Aja
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Maddison, Chris J.
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Guez, Arthur
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Sifre, Laurent
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

van den Driessche, George
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Schrittwieser, Julian
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Antonoglou, Ioannis
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Panneershelvam, Veda
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Lanctot, Marc
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Dieleman, Sander
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Grewe, Dominik
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Nham, John
论文数: 0 引用数: 0
h-index: 0
机构:
Google, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Kalchbrenner, Nal
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Sutskever, Ilya
论文数: 0 引用数: 0
h-index: 0
机构:
Google, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Lillicrap, Timothy
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Leach, Madeleine
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Kavukcuoglu, Koray
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Graepel, Thore
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England

Hassabis, Demis
论文数: 0 引用数: 0
h-index: 0
机构:
Google DeepMind, 5 New St Sq, London EC4A 3TW, England Google DeepMind, 5 New St Sq, London EC4A 3TW, England
[33]
Southey F., 2012, arXiv
[34]
Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
[35]
Grandmaster level in StarCraft II using multi-agent reinforcement learning
[J].
Vinyals, Oriol
;
Babuschkin, Igor
;
Czarnecki, Wojciech M.
;
Mathieu, Michael
;
Dudzik, Andrew
;
Chung, Junyoung
;
Choi, David H.
;
Powell, Richard
;
Ewalds, Timo
;
Georgiev, Petko
;
Oh, Junhyuk
;
Horgan, Dan
;
Kroiss, Manuel
;
Danihelka, Ivo
;
Huang, Aja
;
Sifre, Laurent
;
Cai, Trevor
;
Agapiou, John P.
;
Jaderberg, Max
;
Vezhnevets, Alexander S.
;
Leblond, Remi
;
Pohlen, Tobias
;
Dalibard, Valentin
;
Budden, David
;
Sulsky, Yury
;
Molloy, James
;
Paine, Tom L.
;
Gulcehre, Caglar
;
Wang, Ziyu
;
Pfaff, Tobias
;
Wu, Yuhuai
;
Ring, Roman
;
Yogatama, Dani
;
Wunsch, Dario
;
McKinney, Katrina
;
Smith, Oliver
;
Schaul, Tom
;
Lillicrap, Timothy
;
Kavukcuoglu, Koray
;
Hassabis, Demis
;
Apps, Chris
;
Silver, David
.
NATURE,
2019, 575 (7782)
:350-+

Vinyals, Oriol
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Babuschkin, Igor
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Czarnecki, Wojciech M.
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Mathieu, Michael
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Dudzik, Andrew
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Chung, Junyoung
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Choi, David H.
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Powell, Richard
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Ewalds, Timo
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Georgiev, Petko
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Oh, Junhyuk
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Horgan, Dan
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Kroiss, Manuel
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Danihelka, Ivo
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Huang, Aja
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Sifre, Laurent
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Cai, Trevor
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Agapiou, John P.
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Jaderberg, Max
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Vezhnevets, Alexander S.
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Leblond, Remi
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Pohlen, Tobias
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Dalibard, Valentin
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Budden, David
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Sulsky, Yury
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Molloy, James
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Paine, Tom L.
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Gulcehre, Caglar
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Wang, Ziyu
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Pfaff, Tobias
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Wu, Yuhuai
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Ring, Roman
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Yogatama, Dani
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Wunsch, Dario
论文数: 0 引用数: 0
h-index: 0
机构:
Team Liquid, Utrecht, Netherlands DeepMind, London, England

McKinney, Katrina
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Smith, Oliver
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Schaul, Tom
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Lillicrap, Timothy
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Kavukcuoglu, Koray
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Hassabis, Demis
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Apps, Chris
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Silver, David
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England
[36]
Wortsman Mitchell, 2022, P MACHINE LEARNING R
[37]
L2E: Learning to Exploit Your Opponent
[J].
Wu, Zhe
;
Li, Kai
;
Xu, Hang
;
Zang, Yifan
;
An, Bo
;
Xing, Junliang
.
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN),
2022,

Wu, Zhe
论文数: 0 引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China Chinese Acad Sci, Inst Automat, Beijing, Peoples R China

Li, Kai
论文数: 0 引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China Chinese Acad Sci, Inst Automat, Beijing, Peoples R China

Xu, Hang
论文数: 0 引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China Chinese Acad Sci, Inst Automat, Beijing, Peoples R China

Zang, Yifan
论文数: 0 引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China Chinese Acad Sci, Inst Automat, Beijing, Peoples R China

An, Bo
论文数: 0 引用数: 0
h-index: 0
机构:
Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore Chinese Acad Sci, Inst Automat, Beijing, Peoples R China

Xing, Junliang
论文数: 0 引用数: 0
h-index: 0
机构:
Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
[38]
Yao J., 2024, Adv. Neural Inf. Process. Syst, V36, P67771
[39]
Ye Deheng., 2020, Advances in Neural Information Processing Systems, V33, P621
[40]
Zhou M., 2022, arXiv