Real External Predictivity of QSAR Models. Part 2. New Intercomparable Thresholds for Different Validation Criteria and the Need for Scatter Plot Inspection

被引:431
作者
Chirico, Nicola [1 ]
Gramatica, Paola [1 ]
机构
[1] Univ Insubria, QSAR Res Unit Environm Chem & Ecotoxicol, Dept Theoret & Appl Sci, I-21100 Varese, Italy
关键词
CONCORDANCE CORRELATION; CORRELATION-COEFFICIENT; EVALUATE; METRICS; SET;
D O I
10.1021/ci300084j
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The evaluation of regression QSAR model performance, in fitting, robustness, and external prediction, is of pivotal importance. Over the past decade, different external validation parameters have been proposed: Q(F1)(2), Q(F2)(2), Q(F3)(2), <(r(m)(2))over bar>, and the Golbraikh-Tropsha method. Recently, the concordance correlation coefficient (CCC, Lin), which simply verifies how small the differences are between experimental data and external data set predictions, independently of their range, was proposed by our group as an external Validation parameter for use in QSAR studies. In our preliminary work, we demonstrated with thousands of simulated models that CCC is in good agreement with the Compared validation criteria (except r(m)(2)) using the cutoff values normally applied for the acceptance of QSAR models as externally predictive. In this new work, we have studied and compared the general trends of the various criteria relative to different possible biases (scale and location shifts) in external data distributions, using a wide range of different simulated Scenarios. This study, further supported by visual inspection of experimental vs predicted data scatter plots, has highlighted problems related to some criteria. Indeed, if based on the cutoff suggested by the proponent, <(r(m)(2))over bar> could also accept not predictive models in two of the possible biases (location, location plus scale), while in the case of scale shift bias, it appears to be the most restrictive. Moreover, Q(F1)(2) and Q(F2)(2) showed some problems in one of the possible biases (scale shift). This analysis allowed us to also propose recalibrated and intercomparable for the same data scatter, new thresholds for each criterion in defining a QSAR model as really externally predictive in a more precautionary approach An analysis of the results revealed that the scatter plot of experimental vs predicted external data must always be evaluated to support the statistical criteria values: in some cases high statistical parameter values could hide models with unacceptable predictions.
引用
收藏
页码:2044 / 2058
页数:15
相关论文
共 24 条
[21]   External Validation and Prediction Employing the Predictive Squared Correlation Coefficient - Test Set Activity Mean vs Training Set Activity Mean [J].
Schueuermann, Gerrit ;
Ebert, Ralf-Uwe ;
Chen, Jingwen ;
Wang, Bin ;
Kuehne, Ralph .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2008, 48 (11) :2140-2145
[22]   QSAR models using a large diverse set of estrogens [J].
Shi, LM ;
Fang, H ;
Tong, WD ;
Wu, J ;
Perkins, R ;
Blair, RM ;
Branham, WS ;
Dial, SL ;
Moland, CI ;
Sheehan, DM .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (01) :186-195
[23]   The importance of being earnest: Validation is the absolute essential for successful application and interpretation of QSPR models [J].
Tropsha, A ;
Gramatica, P ;
Gombar, VK .
QSAR & COMBINATORIAL SCIENCE, 2003, 22 (01) :69-77
[24]   Best Practices for QSAR Model Development, Validation, and Exploitation [J].
Tropsha, Alexander .
MOLECULAR INFORMATICS, 2010, 29 (6-7) :476-488