Quantitative structure-activity relationship (QSAR) study of carcinogenicity of polycyclic aromatic hydrocarbons (PAHs) in atmospheric particulate matter by random forest (RF)

被引:17
作者
Li, Nan [1 ]
Qi, Juan [1 ]
Wang, Ping [1 ]
Zhang, Xin [1 ]
Zhang, Tianlong [1 ]
Li, Hua [1 ,2 ]
机构
[1] Northwest Univ, Coll Chem & Mat Sci, Minist Educ, Key Lab Synthet & Nat Funct Mol Chem, Xian 710127, Shaanxi, Peoples R China
[2] Xian Shiyou Univ, Coll Chem & Chem Engn, Xian 710065, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
INDUCED BREAKDOWN SPECTROSCOPY; CLASSIFICATION; LIBS; ELEMENTS; STEEL; DERIVATIVES; PREDICTION; TOXICITY; EXPOSURE; SAMPLES;
D O I
10.1039/c8ay02720j
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The carcinogenicity or toxicity information of a substance can be quickly and easily obtained by using a quantitative structure-activity relationship (QSAR) model. In this study, the carcinogenicity of PAHs was analyzed and predicted by using a random forest (RF) model with the molecular structure information and carcinogenicity data of PAHs. The molecular structure information of 91 PAHs was represented by molecular descriptors (such as structure descriptors, topology descriptors, molecular connectivity index and geometric descriptors) which were calculated by using Dragon5.4 software. The model parameters (ntree and mtry) and input variables were optimized and evaluated with respect to the accuracy, positive predictive value (PPV), negative predictive value (NPV) and out-of-bag (OOB) error. Then, based on the optimized model parameters and input variables, the RF, partial least squares-discriminant analysis (PLSDA) and artificial neural network (ANN) models were constructed to predict the carcinogenicity of PAHs. The results show that the classification accuracy, PPV, NPV and modeling time are 0.9333, 0.8889, 1.0000 and 10.40 s for the RF model, respectively, which shows a better predictive ability than the PLSDA and ANN models for the prediction of the carcinogenicity of PAHs. Therefore, it is demonstrated that RF are a very promising method for the accurate prediction of the carcinogenicity of PAHs.
引用
收藏
页码:1816 / 1821
页数:6
相关论文
共 43 条
[1]   Polycyclic aromatic hydrocarbons (PAHs) in the settled dust of automobile workshops, health and carcinogenic risk evaluation [J].
Ali, Nadeem ;
Ismail, Iqbal Mohammad Ibrahim ;
Khoder, Mamdouh ;
Shamy, Magdy ;
Alghamdi, Mansour ;
Al Khalaf, Abdulrahman ;
Costa, Max .
SCIENCE OF THE TOTAL ENVIRONMENT, 2017, 601 :478-484
[2]   2D-QSAR study of fullerene nanostructure derivatives as potent HIV-1 protease inhibitors [J].
Barzegar, Abolfazl ;
Mousavi, Somaye Jafari ;
Hamidi, Hossein ;
Sadeghi, Mehdi .
PHYSICA E-LOW-DIMENSIONAL SYSTEMS & NANOSTRUCTURES, 2017, 93 :324-331
[3]   3D-QSAR (CoMFA, CoMFA-RG, CoMSIA) and molecular docking study of thienopyrimidine and thienopyridine derivatives to explore structural requirements for aurora-B kinase inhibition [J].
Borisa, Ankit ;
Bhatt, Hardik .
EUROPEAN JOURNAL OF PHARMACEUTICAL SCIENCES, 2015, 79 :1-12
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]  
Cao J. J., 2012, EARTH ENV, V3, P1030
[6]   Identifying relevant molecular descriptors related to carcinogenic activity of Polycyclic Aromatic Hydrocarbons (PAHs) using pattern recognition methods [J].
Coluci, VR ;
Vendrame, R ;
Braga, RS ;
Galvao, DS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (06) :1479-1489
[7]   Breast Imaging Reporting and Data System (BI-RADS) breast composition descriptors: Automated measurement development for full field digital mammography [J].
Fowler, E. E. ;
Sellers, T. A. ;
Lu, B. ;
Heine, J. J. .
MEDICAL PHYSICS, 2013, 40 (11)
[8]   QSAR Modeling of ToxCast Assays Relevant to the Molecular Initiating Events of AOPs Leading to Hepatic Steatosis [J].
Gadaleta, Domenico ;
Manganelli, Serena ;
Roncaglioni, Alessandra ;
Toma, Cosimo ;
Benfenati, Emilio ;
Mombelli, Enrico .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2018, 58 (08) :1501-1517
[9]   IN VIVO TOXICITY OF NITROAROMATICS: A COMPREHENSIVE QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIP STUDY [J].
Gooch, Aminah ;
Sizochenko, Natalia ;
Rasulev, Bakhtiyor ;
Gorb, Leonid ;
Leszczynski, Jerzy .
ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY, 2017, 36 (08) :2227-2233
[10]   Application of random forests method to predict the retention indices of some polycyclic aromatic hydrocarbons [J].
Goudarzi, N. ;
Shahsavani, D. ;
Emadi-Gandaghi, F. ;
Chamjangali, M. Arab .
JOURNAL OF CHROMATOGRAPHY A, 2014, 1333 :25-31