Application of regression learning for gas chromatographic analysis and prediction of toxicity of organic molecules

被引:3
作者
Matyushin, D. D. [1 ]
Buryak, A. K. [1 ]
机构
[1] Russian Acad Sci, AN Frumkin Inst Phys Chem & Electrochem, Build 4,31 Leninsky Prosp, Moscow 119071, Russia
关键词
regression learning; gas chromatography; non-target analysis; toxicity; retention index; lethal dose; support vector regression; RETENTION INDEX; LIQUID-CHROMATOGRAPHY; LIBRARY SEARCH; ADSORPTION; SIMULATION;
D O I
10.1007/s11172-023-3811-2
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
An important task of physical chemistry is to predict the properties of chemical compounds from their structure. Prediction of the chromatographic retention allows one to reject false candidates in gas chromatography/mass spectrometry analysis and to elucidate structures of unknown compounds. Immediately after establishing the structure of an unknown analyte, the next task is to predict its properties, in particular, toxicity. In this work, the problem of prediction of gas chromatographic retention is considered in detail. A new method for predicting the retention indices on different stationary phases using regression learning is demonstrated in relation to flavors and fragrances. The achieved accuracy is higher than the accuracy of previously published methods. The median absolute error does not exceed 14 units. In addition, prediction of acute toxicity from the molecular structure is considered. The efficiency of various regression learning methods for predicting retention indices and acute toxicity (median lethal dose) of chemical compounds is compared.
引用
收藏
页码:482 / 492
页数:11
相关论文
共 49 条
[1]   An open source chemical structure curation pipeline using RDKit [J].
Bento, A. Patricia ;
Hersey, Anne ;
Felix, Eloy ;
Landrum, Greg ;
Gaulton, Anna ;
Atkinson, Francis ;
Bellis, Louisa J. ;
De Veij, Marleen ;
Leach, Andrew R. .
JOURNAL OF CHEMINFORMATICS, 2020, 12 (01)
[2]  
Buryak A. K., 2002, Russ. Chem. Rev, V71, P695, DOI [10.1070/RC2002v071n08ABEH000711, DOI 10.1070/RC2002V071N08ABEH000711]
[3]  
Buryak AK, 1996, RUSS CHEM B+, V45, P582
[4]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[5]   Reoptimization of MDL keys for use in drug discovery [J].
Durant, JL ;
Leland, BA ;
Henry, DR ;
Nourse, JG .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (06) :1273-1280
[6]   Quantitative structure-retention relationships XIV -: Prediction of gas chromatographic retention indices for saturated O-, N-, and S-heterocyclic compounds [J].
Farkas, O ;
Héberger, K ;
Zenkevich, IG .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2004, 72 (02) :173-184
[7]   Deep learning for retention time prediction in reversed-phase liquid chromatography [J].
Fedorova, Elizaveta S. ;
Matyushin, Dmitriy D. ;
Plyushchenko, Ivan, V ;
Stavrianidi, Andrey N. ;
Buryak, Aleksey K. .
JOURNAL OF CHROMATOGRAPHY A, 2022, 1664
[8]   An extensive experimental survey of regression methods [J].
Fernandez-Delgado, M. ;
Sirsat, M. S. ;
Cernadas, E. ;
Alawadi, S. ;
Barro, S. ;
Febrero-Bande, M. .
NEURAL NETWORKS, 2019, 111 :11-34
[9]   Toward realistic computer modeling of paraffin-based composite materials: critical assessment of atomic-scale models of paraffins [J].
Glova, Artyom D. ;
Volgin, Igor V. ;
Nazarychev, Victor M. ;
Larin, Sergey V. ;
Lyulin, Sergey V. ;
Gurtovenko, Andrey A. .
RSC ADVANCES, 2019, 9 (66) :38834-38847
[10]   Quantitative structure-(chromatographic) retention relationships [J].
Heberger, Karoly .
JOURNAL OF CHROMATOGRAPHY A, 2007, 1158 (1-2) :273-305