Artificially intelligent Maxwell's demon for optimal control of open quantum systems

被引:0
作者
Erdman, Paolo A. [1 ]
Czupryniak, Robert [2 ,3 ]
Bhandari, Bibek [3 ]
Jordan, Andrew N. [2 ,3 ,4 ]
Noe, Frank [1 ,5 ]
Eisert, Jens [6 ,7 ,8 ]
Guarnieri, Giacomo [6 ,9 ,10 ]
机构
[1] Free Univ Berlin, Dept Math & Comp Sci, Arnimallee 6, D-14195 Berlin, Germany
[2] Univ Rochester, Dept Phys & Astron, Rochester, NY 14627 USA
[3] Chapman Univ, Inst Quantum Studies, Orange, CA 92866 USA
[4] Chapman Univ, Kennedy Chair Phys, Orange, CA 92866 USA
[5] Microsoft Res AI4Sci, Karl Liebknecht Str 32, D-10178 Berlin, Germany
[6] Free Univ Berlin, Dahlem Ctr Complex Quantum Syst, D-14195 Berlin, Germany
[7] Free Univ Berlin, Dept Phys, Arnimallee 6, D-14195 Berlin, Germany
[8] Rice Univ, Dept Chem, Houston, TX 77005 USA
[9] Univ Pavia, Dept Phys, Sez Pavia, Via Bassi 6, I-27100 Pavia, Italy
[10] Univ Pavia, Sez Pavia, INFN, Via Bassi 6, I-27100 Pavia, Italy
来源
QUANTUM SCIENCE AND TECHNOLOGY | 2025年 / 10卷 / 02期
关键词
quantum thermodynamics; machine learning; optimal control theory; quantum feedback control; LANDAUER ERASURE; REALIZATION; INFORMATION; COLLOQUIUM; PRINCIPLE; PHYSICS;
D O I
10.1088/2058-9565/adbccf
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Feedback control of open quantum systems is of fundamental importance for practical applications in various contexts, ranging from quantum computation to quantum error correction and quantum metrology. Its use in the context of thermodynamics further enables the study of the interplay between information and energy. However, deriving optimal feedback control strategies is highly challenging, as it involves the optimal control of open quantum systems, the stochastic nature of quantum measurement, and the inclusion of policies that maximize a long-term time- and trajectory-averaged goal. In this work, we employ a reinforcement learning approach to automate and capture the role of a quantum Maxwell's demon: the agent takes the literal role of discovering optimal feedback control strategies in qubit-based systems that maximize a trade-off between measurement-powered cooling and measurement efficiency. Considering weak or projective quantum measurements, we explore different regimes based on the ordering between the thermalization, the measurement, and the unitary feedback timescales, finding different and highly non-intuitive, yet interpretable, strategies. In the thermalization-dominated regime, we find strategies with elaborate finite-time thermalization protocols conditioned on measurement outcomes. In the measurement-dominated regime, we find that optimal strategies involve adaptively measuring different qubit observables reflecting the acquired information, and repeating multiple weak measurements until the quantum state is 'sufficiently pure', leading to random walks in state space. Finally, we study the case when all timescales are comparable, finding new feedback control strategies that considerably outperform more intuitive ones. We discuss a two-qubit example where we explore the role of entanglement and conclude discussing the scaling of our results to quantum many-body systems.
引用
收藏
页数:34
相关论文
共 163 条
  • [1] Thermally driven quantum refrigerator autonomously resets a superconducting qubit
    Aamir, Mohammed Ali
    Suria, Paul Jamet
    Guzman, Jose Antonio Marin
    Castillo-Moreno, Claudia
    Epstein, Jeffrey M.
    Halpern, Nicole Yunger
    Gasparinetti, Simone
    [J]. NATURE PHYSICS, 2025, 21 (02) : 318 - 323
  • [2] Single-Ion Heat Engine at Maximum Power
    Abah, O.
    Ronagel, J.
    Jacob, G.
    Deffner, S.
    Schmidt-Kaler, F.
    Singer, K.
    Lutz, E.
    [J]. PHYSICAL REVIEW LETTERS, 2012, 109 (20)
  • [3] Geometric Optimisation of Quantum Thermodynamic Processes
    Abiuso, Paolo
    Miller, Harry J. D.
    Perarnau-Llobet, Marti
    Scandi, Matteo
    [J]. ENTROPY, 2020, 22 (10) : 1 - 21
  • [4] Optimal Cycles for Low-Dissipation Heat Engines
    Abiuso, Paolo
    Perarnau-Llobet, Marti
    [J]. PHYSICAL REVIEW LETTERS, 2020, 124 (11)
  • [5] Non-Markov enhancement of maximum power for quantum thermal machines
    Abiuso, Paolo
    Giovannetti, Vittorio
    [J]. PHYSICAL REVIEW A, 2019, 99 (05)
  • [6] Achiam J., 2018, Spinning up in deep reinforcement learning
  • [7] Assessing the validity of the thermodynamic uncertainty relation in quantum systems
    Agarwalla, Bijay Kumar
    Segal, Dvira
    [J]. PHYSICAL REVIEW B, 2018, 98 (15)
  • [8] QUANTUM OPEN SYSTEM AS A MODEL OF THE HEAT ENGINE
    ALICKI, R
    [J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1979, 12 (05): : L103 - L107
  • [9] Deep reinforcement learning for quantum gate control
    An, Zheng
    Zhou, D. L.
    [J]. EPL, 2019, 126 (06)
  • [10] Quantum Fokker-Planck Master Equation for Continuous Feedback Control
    Annby-Andersson, Bjorn
    Bakhshinezhad, Faraj
    Bhattacharyya, Debankur
    De Sousa, Guilherme
    Jarzynski, Christopher
    Samuelsson, Peter
    Potts, Patrick P.
    [J]. PHYSICAL REVIEW LETTERS, 2022, 129 (05)