Experimental Deep Reinforcement Learning for Error-Robust Gate-Set Design on a Superconducting Quantum Computer

被引：79

作者：

Baum, Yuval ^{[1
,2
]}

Amico, Mirko ^{[1
,2
]}

Howell, Sean ^{[1
,2
]}

Hush, Michael ^{[1
,2
]}

Liuzzi, Maggie ^{[1
,2
]}

Mundada, Pranav ^{[1
,2
]}

Merkh, Thomas ^{[1
,2
]}

Carvalho, Andre R. R. ^{[1
,2
]}

Biercuk, Michael J. ^{[1
,2
,3
]}

机构：

[1] Q CTRL, Sydney, NSW, Australia

[2] Q CTRL, Los Angeles, CA 90013 USA

[3] Univ Sydney, ARC Ctr Engn Quantum Syst, Sydney, NSW, Australia

来源：

PRX QUANTUM | 2021年 / 2卷 / 04期

关键词：

DECOHERENCE; ALGORITHM;

D O I：

10.1103/PRXQuantum.2.040324

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

Quantum computers promise tremendous impact across applications-and have shown great strides in hardware engineering-but remain notoriously error prone. Careful design of low-level controls has been shown to compensate for the processes that induce hardware errors, leveraging techniques from optimal and robust control. However, these techniques rely heavily on the availability of highly accurate and detailed physical models, which generally achieve only sufficient representative fidelity for the most simple operations and generic noise modes. In this work, we use deep reinforcement learning to design a universal set of error-robust quantum logic gates in runtime on a superconducting quantum computer, without requiring knowledge of a specific Hamiltonian model of the system, its controls, or its underlying error processes. We experimentally demonstrate that a fully autonomous deep-reinforcement-learning agent can design single qubit gates up to 3x faster than default DRAG operations without additional leakage error, and exhibiting robustness against calibration drifts over weeks. We then show that ZX (-pi/2) operations implemented using the cross-resonance interaction can outperform hardware default gates by over 2x and equivalently exhibit superior calibration-free performance up to 25 days post optimization. We benchmark the performance of deep-reinforcement-learning-derived gates against other black-box optimization techniques, showing that deep reinforcement learning can achieve comparable or marginally superior performance, even with limited hardware access.

引用

页数：12

共 68 条

[61] Leakage reduction in fast superconducting qubit gates via optimal control [J].