Quantum Machine Learning Algorithms for Drug Discovery Applications

被引:77
作者
Batra, Kushal [1 ]
Zorn, Kimberley M. [2 ]
Foil, Daniel H. [2 ]
Minerali, Eni [2 ]
Gawriljuk, Victor O. [3 ]
Lane, Thomas R. [2 ]
Ekins, Sean [2 ]
机构
[1] North Carolina State Univ, Comp Sci, Raleigh, NC 27606 USA
[2] Collaborat Pharmaceut Inc, Raleigh, NC 27606 USA
[3] Univ Sao Paulo, Sao Carlos Inst Phys, BR-13563120 Sao Carlos, SP, Brazil
基金
美国国家卫生研究院; 巴西圣保罗研究基金会;
关键词
DATABASE;
D O I
10.1021/acs.jcim.1c00166
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The growing quantity of public and private data sets focused on small molecules screened against biological targets or whole organisms provides a wealth of drug discovery relevant data. This is matched by the availability of machine learning algorithms such as Support Vector Machines (SVM) and Deep Neural Networks (DNN) that are computationally expensive to perform on very large data sets with thousands of molecular descriptors. Quantum computer (QC) algorithms have been proposed to offer an approach to accelerate quantum machine learning over classical computer (CC) algorithms, however with significant limitations. In the case of cheminformatics, which is widely used in drug discovery, one of the challenges to overcome is the need for compression of large numbers of molecular descriptors for use on a QC. Here, we show how to achieve compression with data sets using hundreds of molecules (SARS-CoV-2) to hundreds of thousands of molecules (whole cell screening data sets for plague and M. tuberculosis) with SVM and the data reuploading classifier (a DNN equivalent algorithm) on a QC benchmarked against CC and hybrid approaches. This study illustrates the steps needed in order to be "quantum computer ready" in order to apply quantum computing to drug discovery and to provide the foundation on which to build this field.
引用
收藏
页码:2641 / 2647
页数:7
相关论文
共 39 条
[1]  
[Anonymous], ADAM METHOD STOCHAST
[2]  
[Anonymous], 2019, Qiskit: An Open-source Framework for Quantum Computing
[3]   Large scale comparison of QSAR and conformal prediction methods and their applications in drug discovery [J].
Bosc, Nicolas ;
Atkinson, Francis ;
Felix, Eloy ;
Gaulton, Anna ;
Hersey, Anne ;
Leach, Andrew R. .
JOURNAL OF CHEMINFORMATICS, 2019, 11 (1)
[4]  
Broughton M., Tensorflow quantum: A software framework for quantum machine learning
[5]   Open Source Bayesian Models. 1. Application to ADME/Tox and Drug Discovery Datasets [J].
Clark, Alex M. ;
Dole, Krishna ;
Coulon-Spektor, Anna ;
McNutt, Andrew ;
Grass, George ;
Freundlich, Joel S. ;
Reynolds, Robert C. ;
Ekins, Sean .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (06) :1231-1245
[6]   Open Source Bayesian Models. 2. Mining a "Big Dataset" To Create and Validate Models with ChEMBL [J].
Clark, Alex M. ;
Ekins, Sean .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (06) :1246-1260
[7]   hERGCentral: A Large Database to Store, Retrieve, and Analyze Compound-Human Ether-a-go-go Related Gene Channel Interactions to Facilitate Cardiotoxicity Assessment in Drug Development [J].
Du, Fang ;
Yu, Haibo ;
Zou, Beiyan ;
Babcock, Joseph ;
Long, Shunyou ;
Li, Min .
ASSAY AND DRUG DEVELOPMENT TECHNOLOGIES, 2011, 9 (06) :580-588
[8]  
Ekins S, 2020, MACHINE LEARNING MOD
[9]   Exploiting machine learning for end-to-end drug discovery and development [J].
Ekins, Sean ;
Puhl, Ana C. ;
Zorn, Kimberley M. ;
Lane, Thomas R. ;
Russo, Daniel P. ;
Klein, Jennifer J. ;
Hickey, Anthony J. ;
Clark, Alex M. .
NATURE MATERIALS, 2019, 18 (05) :435-441
[10]   Efficacy of Tilorone Dihydrochloride against Ebola Virus Infection [J].
Ekins, Sean ;
Lingerfelt, Mary A. ;
Comer, Jason E. ;
Freiberg, Alexander N. ;
Mirsalis, Jon C. ;
O'Loughlin, Kathleen ;
Harutyunyan, Anush ;
McFarlane, Claire ;
Green, Carol E. ;
Madrid, Peter B. .
ANTIMICROBIAL AGENTS AND CHEMOTHERAPY, 2018, 62 (02)