Real External Predictivity of QSAR Models: How To Evaluate It? Comparison of Different Validation Criteria and Proposal of Using the Concordance Correlation Coefficient

被引:632
作者
Chirico, Nicola [1 ]
Gramatica, Paola [1 ]
机构
[1] Univ Insubria, Dept Struct & Funct Biol, QSAR Res Grp Environm Chem & Ecotoxicol, I-21100 Varese, Italy
关键词
MEAN-SQUARE ERROR; VARIABLE-SELECTION; TRAINING SET; QSPR; TOXICITY; R(M)(2); SIZE;
D O I
10.1021/ci200211n
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The main utility of QSAR models is their ability to predict activities/properties for new chemicals, and this external prediction ability is evaluated by means of various validation criteria. As a measure for such evaluation the OECD guidelines have proposed the predictive squared correlation coefficient Q(F1)(2) (Shi et al.). However, other validation criteria have been proposed by other authors: the Golbraikh-Tropsha method, r(m)(2), (Roy), Q(F2)(2) (Schuurmann et al.), Q(F3)(2) (Consonni et al.). In QSAR studies these measures are usually in accordance, though this is not always the case, thus doubts can arise when contradictory results are obtained. It is likely that none of the aforementioned criteria is the best in every situation, so a comparative study using simulated data sets is proposed here, using threshold values suggested by the proponents or those widely used in QSAR modeling. In addition, a different and simple external validation measure, the concordance correlation coefficient (CCC), is proposed and compared with other criteria. Huge data sets were used to study the general behavior of validation measures, and the concordance correlation coefficient was shown to be the most restrictive. On using simulated data sets of a more realistic size, it was found that CCC was broadly in agreement, about 96% of the time, with other validation measures in accepting models as predictive, and in almost all the examples it was the most precautionary. The proposed concordance correlation coefficient also works well on real data sets, where it seems to be more stable, and helps in making decisions when the validation measures are in conflict. Since it is conceptually simple, and given its stability and restrictiveness, we propose the concordance correlation coefficient as a complementary, or alternative, more prudent measure of a QSAR model to be externally predictive.
引用
收藏
页码:2320 / 2335
页数:16
相关论文
共 43 条
[1]  
[Anonymous], CHEMIDPLUS ADV
[2]   The better predictive model:: High q2 for the training set or low root mean square error of prediction for the test set? [J].
Aptula, AO ;
Jeliazkova, NG ;
Schultz, TW ;
Cronin, MTD .
QSAR & COMBINATORIAL SCIENCE, 2005, 24 (03) :385-396
[3]   Validation tools for variable subset regression [J].
Baumann, K ;
Stiefl, N .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2004, 18 (7-9) :549-562
[4]   Cross-validation as the objective function for variable-selection techniques [J].
Baumann, K .
TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2003, 22 (06) :395-406
[5]  
Benigni R., 2004, REPORT EXPERT GROUP, P84
[6]   Modelling physico-chemical properties of (benzo)triazoles, and screening for environmental partitioning [J].
Bhhatarai, B. ;
Gramatica, P. .
WATER RESEARCH, 2011, 45 (03) :1463-1471
[7]   Per- and Polyfluoro Toxicity (LC50 Inhalation) Study in Rat and Mouse Using QSAR Modeling [J].
Bhhatarai, Baron ;
Gramatica, Paola .
CHEMICAL RESEARCH IN TOXICOLOGY, 2010, 23 (03) :528-539
[8]   Oral LD50 toxicity modeling and prediction of per- and polyfluorinated chemicals on rat and mouse [J].
Bhhatarai, Barun ;
Gramatica, Paola .
MOLECULAR DIVERSITY, 2011, 15 (02) :467-476
[9]   CADASTER QSPR Models for Predictions of Melting and Boiling Points of Perfluorinated Chemicals [J].
Bhhatarai, Barun ;
Teetz, Wolfram ;
Liu, Tao ;
Oberg, Tomas ;
Jeliazkova, Nina ;
Kochev, Nikolay ;
Pukalov, Ognyan ;
Tetko, Igor V. ;
Kovarich, Simona ;
Papa, Ester ;
Gramatica, Paola .
MOLECULAR INFORMATICS, 2011, 30 (2-3) :189-204
[10]   Are Mechanistic and Statistical QSAR Approaches Really Different? MLR Studies on 158 Cycloalkyl-Pyranones [J].
Bhhatarai, Barun ;
Garg, Rajni ;
Gramatica, Paola .
MOLECULAR INFORMATICS, 2010, 29 (6-7) :511-522