Quantum Reinforcement Learning Applied to Board Games

被引：0

作者：

Teixeira, Miguel ^{[1
]}

Rocha, Ana Paula ^{[2
]}

Castro, Antonio J. M. ^{[3
]}

机构：

[1] Univ Porto, Dept Informat Engn DEI, Fac Engn FEUP, Porto, Portugal

[2] Univ Porto, Fac Engn FEUP, Dept Informat Engn DEI, Artificial Intelligence & Comp Sci Lab LIACC, Porto, Portugal

[3] Univ Porto, Artificial Intelligence & Comp Sci Lab LIACC, Porto, Portugal

来源：

2021 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2021) | 2021年

关键词：

reinforcement learning; quantum computing; board games;

D O I：

10.1145/3486622.3493944

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning is a machine learning paradigm where an agent learns how to optimize its behavior solely through its interaction with the environment. It has been extensively studied and successfully applied to complex problems of many different domains in the past decades, i.e., robotics, games, scheduling. However, the performance of these algorithms becomes limited as the complexity and dimension of the state-action space increases. Recent advances in quantum computing and quantum information have sparked interest in possible applications to machine learning. By taking advantage of quantum mechanics, it is possible to efficiently process immense quantities of information and improve computational speed. In this work, we combined quantum computing with reinforcement learning and studied its application to a board game to assess the benefits that it can introduce, namely its impact on the learning efficiency of an agent. From the results, we concluded that the proposed quantum exploration policy improved the convergence rate of the agent and promoted a more efficient exploration of the state space.

引用

页码：343 / 350

页数：8

共 13 条

[1] Measurement-based adaptation protocol with quantum reinforcement learning
Albarran-Arriagada, F.
Retamal, J. C.
Solano, E.
Lamata, L.
[J]. PHYSICAL REVIEW A, 2018, 98 (04)
[2] DYNAMIC PROGRAMMING
BELLMAN, R
[J]. SCIENCE, 1966, 153 (3731) : 34 - &
[3] Quantum machine learning
Biamonte, Jacob
Wittek, Peter
Pancotti, Nicola
Rebentrost, Patrick
Wiebe, Nathan
Lloyd, Seth
[J]. NATURE, 2017, 549 (7671) : 195 - 202
[4] Brassard G., 2002, Contemporary Mathematics, V305, P53, DOI DOI 10.1090/CONM/305/05215
[5] Projective simulation for artificial intelligence
Briegel, Hans J.
De las Cuevas, Gemma
[J]. SCIENTIFIC REPORTS, 2012, 2
[6] Chen Samuel Yen-Chi, 2020, VARIATIONAL QUANTUM, V8, DOI [10.1109/ACCESS.IEEEAccess2020.3010470, DOI 10.1109/ACCESS.IEEEACCESS2020.3010470]
[7] Quantum Speedup for Active Learning Agents
Davide Paparo, Giuseppe
Dunjko, Vedran
Makmal, Adi
Angel Martin-Delgado, Miguel
Briegel, Hans J.
[J]. PHYSICAL REVIEW X, 2014, 4 (03):
[8] Quantum reinforcement learning
Dong, Daoyi
Chen, Chunlin
Li, Hanxiong
Tarn, Tzyh-Jong
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (05): : 1207 - 1220
[9] SEARCH VIA QUANTUM WALK
Magniez, Frederic
Nayak, Ashwin
Roland, Jeremie
Santha, Miklos
[J]. SIAM JOURNAL ON COMPUTING, 2011, 40 (01) : 142 - 164
[10] Ng AY, 1999, MACHINE LEARNING, PROCEEDINGS, P278

← 1 2 →