Comprehensive strategies of machine-learning-based quantitative structure-activity relationship models

被引:80
作者
Mao, Jiashun [1 ,2 ,3 ]
Akhtar, Javed [2 ,4 ]
Zhang, Xiao [5 ]
Sun, Liang [6 ]
Guan, Shenghui [2 ,3 ]
Li, Xinyu [7 ,8 ]
Chen, Guangming [2 ,4 ]
Liu, Jiaxin [9 ]
Jeon, Hyeon-Nae [9 ]
Kim, Min Sung [9 ]
No, Kyoung Tai [1 ]
Wang, Guanyu [2 ,3 ,4 ]
机构
[1] Yonsei Univ, Interdisciplinary Grad Program Integrat Biotechno, Incheon 21983, South Korea
[2] Southern Univ Sci & Technol, Sch Life Sci, Dept Biol, 1088 Xueyuan Ave, Shenzhen 518055, Guangdong, Peoples R China
[3] Guangdong Prov Key Lab Computat Sci & Mat Design, Shenzhen 518055, Guangdong, Peoples R China
[4] Guangdong Prov Key Lab Cell Microenvironm & Dis R, Shenzhen 518055, Guangdong, Peoples R China
[5] Shanghai Rural Commercial Bank Co Ltd, Shanghai 200002, Peoples R China
[6] City Univ Hong Kong, Dept Phys, Kowloon, 83 Tat Chee Ave, Hong Kong, Peoples R China
[7] Chinese Univ Hong Kong, Sch Life & Hlth Sci, Shenzhen 518172, Peoples R China
[8] Chinese Univ Hong Kong, Warshel Inst Computat Biol, Shenzhen 518172, Peoples R China
[9] Yonsei Univ, Coll Life Sci & Biotechnol, Biotechnol, Seoul 03722, South Korea
基金
中国国家自然科学基金;
关键词
QUANTUM-MECHANICAL SIMULATION; DRUG DISCOVERY; MOLECULAR DESCRIPTOR; SCORING FUNCTION; NEURAL-NETWORKS; CLASSIFICATION MODELS; CHEMICAL DESCRIPTORS; BINDING AFFINITIES; KINASE INHIBITORS; SEARCH STRATEGIES;
D O I
10.1016/j.isci.2021.103052
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Early quantitative structure-activity relationship (QSAR) technologies have unsatisfactory versatility and accuracy in fields such as drug discovery because they are based on traditional machine learning and interpretive expert features. The development of Big Data and deep learning technologies significantly improve the processing of unstructured data and unleash the great potential of QSAR. Here we discuss the integration of wet experiments (which provide experimental data and reliable verification), molecular dynamics simulation (which provides mechanistic interpretation at the atomic/molecular levels), and machine learning (including deep learning) techniques to improve QSAR models. We first review the history of traditional QSAR and point out its problems. We then propose a better QSAR model characterized by a new iterative framework to integrate machine learning with disparate data input. Finally, we discuss the application of QSAR and machine learning to many practical research fields, including drug development and clinical trials.
引用
收藏
页数:20
相关论文
共 280 条
[1]   CRYO-ELECTRON MICROSCOPY OF VIRUSES [J].
ADRIAN, M ;
DUBOCHET, J ;
LEPAULT, J ;
MCDOWALL, AW .
NATURE, 1984, 308 (5954) :32-36
[2]   Deep Learning Accurately Predicts Estrogen Receptor Status in Breast Cancer Metabolomics Data [J].
Alakwaa, Fadhl M. ;
Chaudhary, Kumardeep ;
Garmire, Lana X. .
JOURNAL OF PROTEOME RESEARCH, 2018, 17 (01) :337-347
[3]   3D-QSAR studies on Maslinic acid analogs for Anticancer activity against Breast Cancer cell line MCF-7 [J].
Alam, Sarfaraz ;
Khan, Feroz .
SCIENTIFIC REPORTS, 2017, 7
[4]  
Alaya MZ, 2019, J MACH LEARN RES, V20
[5]   Low Data Drug Discovery with One-Shot Learning [J].
Altae-Tran, Han ;
Ramsundar, Bharath ;
Pappu, Aneesh S. ;
Pande, Vijay .
ACS CENTRAL SCIENCE, 2017, 3 (04) :283-293
[6]  
Altszyler E, 2018, REPRESENTATION LEARNING FOR NLP, P1
[7]  
Alves VM, 2016, GREEN CHEM, V18, P4348, DOI [10.1039/C6GC01492E, 10.1039/c6gc01492e]
[8]   The process of structure-based drug design [J].
Anderson, AC .
CHEMISTRY & BIOLOGY, 2003, 10 (09) :787-797
[9]   Direct Monte Carlo simulation of chemical reaction systems: Prediction of ultrafast detonations [J].
Anderson, JB ;
Long, LN .
JOURNAL OF CHEMICAL PHYSICS, 2003, 118 (07) :3102-3110
[10]   4D-QSAR: Perspectives in Drug Design [J].
Andrade, Carolina H. ;
Pasqualoto, Kerly F. M. ;
Ferreira, Elizabeth I. ;
Hopfinger, Anton J. .
MOLECULES, 2010, 15 (05) :3281-3294