Nuclear spin-spin coupling constants prediction based on XGBoost and LightGBM algorithms

被引:15
作者
Zhang, Xin-xin [1 ]
Deng, Tong [1 ]
Jia, Guo-zhu [1 ]
机构
[1] Sichuan Normal Univ, Coll Phys & Elect Engn, Chengdu, Sichuan, Peoples R China
关键词
Chemoinformatics; molecular property; spin-spin coupling constant; XGBoost; LightGBM; AB-INITIO; SHIELDING CONSTANTS; DECISION TREE;
D O I
10.1080/00268976.2019.1696478
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Nuclear magnetic resonance (NMR) is a robust method for the analysis of molecular complex structures, and the measurement of the nuclear spin-spin coupling constant is the key. In this paper, based on the 3D coordinates of the atoms in the molecule, the spin-spin coupling constants of atom-pairs are directly predicted using Extreme Gradient Boosting (XGBoost) and Light Gradient Boosting Machine (LightGBM). The calculated result of DFT method is taken as the target value. Experiment shows that LightGBM (R-2: 0.93) overall performance is better than XGBoost. In some molecules, the predicted fit (R-2) of the coupling constant between atoms even reached 1.00. This research avoids complex quantum mechanics and can assist in NMR to gain insight into the structure and dynamics of molecules, thereby enriching the data information analysis method of nuclear magnetic interaction.
引用
收藏
页数:10
相关论文
共 50 条
[1]  
[Anonymous], ery and Data Mining, DOI DOI 10.1145/2939672.2939785
[2]   NMR techniques for the investigation of solvation phenomena and non-covalent interactions [J].
Bagno, A ;
Rastrelli, F ;
Saielli, G .
PROGRESS IN NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY, 2005, 47 (1-2) :41-93
[3]  
Ballester P.J, 2019, MACHINE LEARNING MOL
[4]   DFT calculation of NMR JFF spin-spin coupling constants in fluorinated pyridines [J].
Barone, V ;
Peralta, JE ;
Contreras, RH ;
Snyder, JP .
JOURNAL OF PHYSICAL CHEMISTRY A, 2002, 106 (23) :5607-5612
[5]   AB-INITIO STUDY OF THE NMR SHIELDING CONSTANTS AND SPIN-SPIN COUPLING-CONSTANTS IN CYCLOPROPENE [J].
BARSZCZEWICZ, A ;
JASZUNSKI, M ;
KAMIENSKATRELA, K ;
HELGAKER, T ;
JORGENSEN, P ;
VAHTRAS, O .
THEORETICA CHIMICA ACTA, 1993, 87 (1-2) :19-28
[6]  
Beltran J.C., 2019, 2019 IEEE C COMP INT, P1
[7]  
Chen R.-C., 2017, EICIENT COST AWARE C
[8]   Efficient Cost-Aware Cascade Ranking in Multi-Stage Retrieval [J].
Chen, Ruey-Cheng ;
Gallagher, Luke ;
Blanco, Roi ;
Culpepper, J. Shane .
SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, :445-454
[9]  
Chen X., 2018, INT C BIOINSP COMP T, P307
[10]  
Chowdhury R., 2019, ARXIV190508444