Hyperparameter importance and optimization of quantum neural networks across small datasets

被引：0

作者：

Charles Moussa

Yash J. Patel

Vedran Dunjko

Thomas Bäck

Jan N. van Rijn

机构：

[1] LIACS,

[2] Leiden University,undefined

来源：

Machine Learning | 2024年 / 113卷

关键词：

Hyperparameter importance; Quantum neural networks; Quantum machine learning; Hyperparameter optimization.;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

As restricted quantum computers become available, research focuses on finding meaningful applications. For example, in quantum machine learning, a special type of quantum circuit called a quantum neural network is one of the most investigated approaches. However, we know little about suitable circuit architectures or important model hyperparameters for a given task. In this work, we apply the functional ANOVA framework to the quantum neural network architectures to analyze which of the quantum machine learning hyperparameters are most influential for their predictive performance. We restrict our study to 7 open-source datasets from the OpenML-CC18 classification benchmark, which are small enough for simulations on quantum hardware with fewer than 20 qubits. Using this framework, three main levels of importance were identified, confirming expected patterns and revealing new insights. For instance, the learning rate is identified as the most important hyperparameter on all datasets, whereas the particular choice of entangling gates used is found to be the least important on all except for one dataset. In addition to identifying the relevant hyperparameters, for each of them, we also learned data-driven priors based on values that perform well on previously seen datasets, which can then be used to steer hyperparameter optimization processes. We utilize these priors in the hyperparameter optimization method hyperband and show that these improve performance against uniform sampling across all datasets by, on average, 0.53%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.53 \%$$\end{document}, up to 6.11%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$6.11 \%$$\end{document}, in cross-validation accuracy. We also demonstrate that such improvements hold on average regardless of the configuration hyperband is run with. Our work introduces new methodologies for studying quantum machine learning models toward quantum model selection in practice. All research code is made publicly available.

引用

页码：1941 / 1966

页数：25

共 153 条

[1] Benedetti M(2019)Parameterized quantum circuits as machine learning models Quantum Science and Technology 4 5-32
[2] Lloyd E(2001)Random forests Machine Learning 45 582-61
[3] Sack S(2021)Encoding-dependent generalization bounds for parametrized quantum circuits Quantum 5 1791-185
[4] Fiorentini M(2021)Cost function dependent barren plateaus in shallow parametrized quantum circuits Nature communications 12 62-212
[5] Breiman L(2022)Quantum circuit architecture search for variational quantum algorithms Quantum Information 8 1-732
[6] Caro MC(2020)Auto-sklearn 2.0: Hands-free automl via meta-learning The Journal of Machine Learning Research 23 153-28375
[7] Gil-Fuster E(2014)Quantum simulation. Review of Modern Physics 86 209-246
[8] Meyer JJ(2023)Quantum machine learning of large datasets using randomized measurements Machine Learning: Science and Technology 4 709-6816
[9] Eisert J(2019)Supervised learning with quantum-enhanced feature spaces Nature 567 517-1017
[10] Sweke R(2007)Generalized functional anova diagnostics for high-dimensional functions of dependent variables Journal of Computational and Graphical Statistics 16 28362-6

← 1 2 3 4 5 6 7 8 9 10 →