Quantum generative adversarial imitation learning

被引：3

作者：

Xiao, Tailong ^{[1
,2
]}

Huang, Jingzheng ^{[1
,2
]}

Li, Hongjing ^{[1
,2
]}

Fan, Jianping ^{[3
]}

Zeng, Guihua ^{[1
,2
]}

机构：

[1] Shanghai Jiao Tong Univ, State Key Lab Adv Opt Commun Syst & Networks, Shanghai 200240, Peoples R China

[2] Shanghai Jiao Tong Univ, Ctr Quantum Sensing & Informat Proc, Shanghai 200240, Peoples R China

[3] Univ N Carolina, Dept Comp Sci, Charlotte, NC 28223 USA

来源：

NEW JOURNAL OF PHYSICS | 2023年 / 25卷 / 03期

基金：

中国国家自然科学基金;

关键词：

quantum machine learning; quantum sensing; quantum control;

D O I：

10.1088/1367-2630/acc605

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

Investigating quantum advantage in the NISQ era is a challenging problem whereas quantum machine learning becomes the most promising application that can be resorted to. However, no proposal has been investigated for arguably challenging inverse reinforcement learning to demonstrate the potential advantage. In this work, we propose a hybrid quantum-classical inverse reinforcement learning algorithm based on the variational quantum circuit with the generative adversarial framework. We find an important connection between the quantum gradient anomaly and the performance degradation, which suggest a gradient clipping strategy to stabilize the training process. In light of the algorithm, we study three classic control problems and the Hamiltonian parameter estimation in quantum sensing with shallow quantum circuits. The numerical results showcase that the control-enhanced quantum sensor can saturate quantum Cramer-Rao bound only with a single variational layer, empirically demonstrating a parameter complexity advantage over the classical learning control. The proposed generative adversarial reinforcement learning algorithm achieves state-of-the-art performance in classical and quantum sensor control in terms of required number of parameters.

引用

页数：23

共 50 条

[1] Generative Adversarial Imitation Learning
Ho, Jonathan
Ermon, Stefano
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[2] Deterministic generative adversarial imitation learning
Zuo, Guoyu
Chen, Kexin
Lu, Jiahao
Huang, Xiangsheng
NEUROCOMPUTING, 2020, 388 : 60 - 69
[3] A Bayesian Approach to Generative Adversarial Imitation Learning
Jeon, Wonseok
Seo, Seokin
Kim, Kee-Eung
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[4] Quantum Generative Adversarial Learning
Lloyd, Seth
Weedbrook, Christian
PHYSICAL REVIEW LETTERS, 2018, 121 (04)
[5] Robot Manipulation Learning Using Generative Adversarial Imitation Learning
Jabri, Mohamed Khalil
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4893 - 4894
[6] A Survey of Imitation Learning Based on Generative Adversarial Nets
Lin J.-H.
Zhang Z.-Z.
Jiang C.
Hao J.-Y.
Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (02): : 326 - 351
[7] Ranking-Based Generative Adversarial Imitation Learning
Shi, Zhipeng
Zhang, Xuehe
Fang, Yu
Li, Changle
Liu, Gangfeng
Zhao, Jie
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (10): : 8967 - 8974
[8] Generative Adversarial Imitation Learning from Failed Experiences
Zhu, Jiacheng
Lin, Jiahao
Wang, Meng
Chen, Yingfeng
Fan, Changjie
Jiang, Chong
Zhang, Zongzhang
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13997 - 13998
[9] TextGAIL: Generative Adversarial Imitation Learning for Text Generation
Wu, Qingyang
Li, Lei
Yu, Zhou
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14067 - 14075
[10] Multimodal Storytelling via Generative Adversarial Imitation Learning
Chen, Zhiqian
Zhang, Xuchao
Boedihardjo, Arnold P.
Dai, Jing
Lu, Chang-Tien
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3967 - 3973

← 1 2 3 4 5 →