Quantum Reinforcement Learning Applied to Board Games

被引:0
作者
Teixeira, Miguel [1 ]
Rocha, Ana Paula [2 ]
Castro, Antonio J. M. [3 ]
机构
[1] Univ Porto, Dept Informat Engn DEI, Fac Engn FEUP, Porto, Portugal
[2] Univ Porto, Fac Engn FEUP, Dept Informat Engn DEI, Artificial Intelligence & Comp Sci Lab LIACC, Porto, Portugal
[3] Univ Porto, Artificial Intelligence & Comp Sci Lab LIACC, Porto, Portugal
来源
2021 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2021) | 2021年
关键词
reinforcement learning; quantum computing; board games;
D O I
10.1145/3486622.3493944
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning is a machine learning paradigm where an agent learns how to optimize its behavior solely through its interaction with the environment. It has been extensively studied and successfully applied to complex problems of many different domains in the past decades, i.e., robotics, games, scheduling. However, the performance of these algorithms becomes limited as the complexity and dimension of the state-action space increases. Recent advances in quantum computing and quantum information have sparked interest in possible applications to machine learning. By taking advantage of quantum mechanics, it is possible to efficiently process immense quantities of information and improve computational speed. In this work, we combined quantum computing with reinforcement learning and studied its application to a board game to assess the benefits that it can introduce, namely its impact on the learning efficiency of an agent. From the results, we concluded that the proposed quantum exploration policy improved the convergence rate of the agent and promoted a more efficient exploration of the state space.
引用
收藏
页码:343 / 350
页数:8
相关论文
共 13 条
  • [1] Measurement-based adaptation protocol with quantum reinforcement learning
    Albarran-Arriagada, F.
    Retamal, J. C.
    Solano, E.
    Lamata, L.
    [J]. PHYSICAL REVIEW A, 2018, 98 (04)
  • [2] DYNAMIC PROGRAMMING
    BELLMAN, R
    [J]. SCIENCE, 1966, 153 (3731) : 34 - &
  • [3] Quantum machine learning
    Biamonte, Jacob
    Wittek, Peter
    Pancotti, Nicola
    Rebentrost, Patrick
    Wiebe, Nathan
    Lloyd, Seth
    [J]. NATURE, 2017, 549 (7671) : 195 - 202
  • [4] Brassard G., 2002, Contemporary Mathematics, V305, P53, DOI DOI 10.1090/CONM/305/05215
  • [5] Projective simulation for artificial intelligence
    Briegel, Hans J.
    De las Cuevas, Gemma
    [J]. SCIENTIFIC REPORTS, 2012, 2
  • [6] Chen Samuel Yen-Chi, 2020, VARIATIONAL QUANTUM, V8, DOI [10.1109/ACCESS.IEEEAccess2020.3010470, DOI 10.1109/ACCESS.IEEEACCESS2020.3010470]
  • [7] Quantum Speedup for Active Learning Agents
    Davide Paparo, Giuseppe
    Dunjko, Vedran
    Makmal, Adi
    Angel Martin-Delgado, Miguel
    Briegel, Hans J.
    [J]. PHYSICAL REVIEW X, 2014, 4 (03):
  • [8] Quantum reinforcement learning
    Dong, Daoyi
    Chen, Chunlin
    Li, Hanxiong
    Tarn, Tzyh-Jong
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (05): : 1207 - 1220
  • [9] SEARCH VIA QUANTUM WALK
    Magniez, Frederic
    Nayak, Ashwin
    Roland, Jeremie
    Santha, Miklos
    [J]. SIAM JOURNAL ON COMPUTING, 2011, 40 (01) : 142 - 164
  • [10] Ng AY, 1999, MACHINE LEARNING, PROCEEDINGS, P278