On the Misleading Use of QF32 for QSAR Model Comparison

被引:33
作者
Consonni, Viviana [1 ]
Todeschini, Roberto [1 ]
Ballabio, Davide [1 ]
Grisoni, Francesca [1 ]
机构
[1] Univ Milano Bicocca, Dept Earth & Environm Sci, Piazza Sci 1, I-20126 Milan, Italy
关键词
QSAR; external validation; model comparison; Q(2)-like metrics; EXTERNAL VALIDATION; PREDICTION; ERROR; SYSTEM;
D O I
10.1002/minf.201800029
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Quantitative Structure - Activity Relationship (QSAR) models play a central role in medicinal chemistry, toxicology and computer-assisted molecular design, as well as a support for regulatory decisions and animal testing reduction. Thus, assessing their predictive ability becomes an essential step for any prospective application. Many metrics have been proposed to estimate the model predictive ability of QSARs, which have created confusion on how models should be evaluated and properly compared. Recently, we showed that the metric QF32 is particularly well-suited for comparing the external predictivity of different models developed on the same training dataset. However, when comparing models developed on different training data, this function becomes inadequate and only dispersion measures like the root-mean-square error (RMSE) should be used. The intent of this work is to provide clarity on the correct and incorrect uses of QF32, discussing its behavior towards the training data distribution and illustrating some cases in which QF32 estimates may be misleading. Hereby, we encourage the usage of measures of dispersions when models trained on different datasets have to be compared and evaluated.
引用
收藏
页数:4
相关论文
共 22 条
[1]   Beware of R2: Simple, Unambiguous Assessment of the Prediction Accuracy of QSAR and QSPR Models [J].
Alexander, D. L. J. ;
Tropsha, A. ;
Winkler, David A. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (07) :1316-1322
[2]   Reliable estimation of prediction errors for QSAR models under model uncertainty using double cross-validation [J].
Baumann, Desiree ;
Baumann, Knut .
JOURNAL OF CHEMINFORMATICS, 2014, 6
[3]  
Benfenati E., 2007, Quantitative Structure-Activity Relationships (QSAR) for Pesticide Regulatory Purposes
[4]   Validation and extension of a similarity-based approach for prediction of acute aquatic toxicity towards Daphnia magna [J].
Cassotti, M. ;
Consonni, V. ;
Mauri, A. ;
Ballabio, D. .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2014, 25 (12) :1013-1036
[5]   QSAR Modeling: Where Have You Been? Where Are You Going To? [J].
Cherkasov, Artem ;
Muratov, Eugene N. ;
Fourches, Denis ;
Varnek, Alexandre ;
Baskin, Igor I. ;
Cronin, Mark ;
Dearden, John ;
Gramatica, Paola ;
Martin, Yvonne C. ;
Todeschini, Roberto ;
Consonni, Viviana ;
Kuz'min, Victor E. ;
Cramer, Richard ;
Benigni, Romualdo ;
Yang, Chihae ;
Rathman, James ;
Terfloth, Lothar ;
Gasteiger, Johann ;
Richard, Ann ;
Tropsha, Alexander .
JOURNAL OF MEDICINAL CHEMISTRY, 2014, 57 (12) :4977-5010
[6]   Real External Predictivity of QSAR Models: How To Evaluate It? Comparison of Different Validation Criteria and Proposal of Using the Concordance Correlation Coefficient [J].
Chirico, Nicola ;
Gramatica, Paola .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2011, 51 (09) :2320-2335
[7]   Evaluation of model predictive ability by external validation techniques [J].
Consonni, Viviana ;
Ballabio, Davide ;
Todeschini, Roberto .
JOURNAL OF CHEMOMETRICS, 2010, 24 (3-4) :194-201
[8]   Comments on the Definition of the Q2 Parameter for QSAR Validation [J].
Consonni, Viviana ;
Ballabio, Davide ;
Todeschini, Roberto .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (07) :1669-1678
[9]   A generalizable definition of chemical similarity for read-across [J].
Floris, Matteo ;
Manganaro, Alberto ;
Nicolotti, Orazio ;
Medda, Ricardo ;
Mangiatordi, Giuseppe Felice ;
Benfenati, Emilio .
JOURNAL OF CHEMINFORMATICS, 2014, 6
[10]   A Historical Excursus on the Statistical Validation Parameters for QSAR Models: A Clarification Concerning Metrics and Terminology [J].
Gramatica, Paola ;
Sangion, Alessandro .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2016, 56 (06) :1127-1131