Predictiveness curves in virtual screening

被引:76
作者
Empereur-mot, Charly [1 ]
Guillemain, Helene [1 ]
Latouche, Aurelien [2 ]
Zagury, Jean-Francois [1 ]
Viallon, Vivian [3 ,4 ,5 ]
Montes, Matthieu [1 ]
机构
[1] Conservatoire Natl Arts & Metiers, Lab Genom Bioinformat & Applicat, EA 4627, F-75003 Paris, France
[2] Conservatoire Natl Arts & Metiers, Equipe MSDMA, Lab CEDRIC, EA 4629, F-75003 Paris, France
[3] Univ Lyon, F-69622 Lyon, France
[4] Univ Lyon 1, UMRESTTE, F-69373 Lyon, France
[5] IFSTTAR, UMRESTTE, F-69675 Bron, France
关键词
HIGH-THROUGHPUT DOCKING; CONTINUOUS MARKERS; PERFORMANCE;
D O I
10.1186/s13321-015-0100-8
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Background: In the present work, we aim to transfer to the field of virtual screening the predictiveness curve, a metric that has been advocated in clinical epidemiology. The literature describes the use of predictiveness curves to evaluate the performances of biological markers to formulate diagnoses, prognoses and assess disease risks, assess the fit of risk models, and estimate the clinical utility of a model when applied to a population. Similarly, we use logistic regression models to calculate activity probabilities related to the scores that the compounds obtained in virtual screening experiments. The predictiveness curve can provide an intuitive and graphical tool to compare the predictive power of virtual screening methods. Results: Similarly to ROC curves, predictiveness curves are functions of the distribution of the scores and provide a common scale for the evaluation of virtual screening methods. Contrarily to ROC curves, the dispersion of the scores is well described by predictiveness curves. This property allows the quantification of the predictive performance of virtual screening methods on a fraction of a given molecular dataset and makes the predictiveness curve an efficient tool to address the early recognition problem. To this last end, we introduce the use of the total gain and partial total gain to quantify recognition and early recognition of active compounds attributed to the variations of the scores obtained with virtual screening methods. Additionally to its usefulness in the evaluation of virtual screening methods, predictiveness curves can be used to define optimal score thresholds for the selection of compounds to be tested experimentally in a drug discovery program. We illustrate the use of predictiveness curves as a complement to ROC on the results of a virtual screening of the Directory of Useful Decoys datasets using three different methods (Surflex-dock, ICM, Autodock Vina). Conclusion: The predictiveness curves cover different aspects of the predictive power of the scores, allowing a detailed evaluation of the performance of virtual screening methods. We believe predictiveness curves efficiently complete the set of tools available for the analysis of virtual screening results.
引用
收藏
页数:17
相关论文
共 36 条
[1]   ICM - A NEW METHOD FOR PROTEIN MODELING AND DESIGN - APPLICATIONS TO DOCKING AND STRUCTURE PREDICTION FROM THE DISTORTED NATIVE CONFORMATION [J].
ABAGYAN, R ;
TOTROV, M ;
KUZNETSOV, D .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1994, 15 (05) :488-506
[2]   High-throughput docking as a source of novel drug leads [J].
Alvarez, JC .
CURRENT OPINION IN CHEMICAL BIOLOGY, 2004, 8 (04) :365-370
[3]  
[Anonymous], AM STAT
[4]   LOCAL OPTIMA AVOIDANCE IN DEPOT LOCATION [J].
BAXTER, J .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1981, 32 (09) :815-819
[5]  
Bura E, 2001, BIOMETRICAL J, V43, P5, DOI 10.1002/1521-4036(200102)43:1<5::AID-BIMJ5>3.0.CO
[6]  
2-6
[7]   The effectiveness of risk scores: the logit rank plot [J].
Copas, J .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 1999, 48 :165-183
[8]   Comparison of Several Molecular Docking Programs: Pose Prediction and Virtual Screening Accuracy [J].
Cross, Jason B. ;
Thompson, David C. ;
Rai, Brajesh K. ;
Baber, J. Christian ;
Fan, Kristi Yi ;
Hu, Yongbo ;
Humblet, Christine .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (06) :1455-1474
[9]   Effectiveness of retrieval in similarity searches of chemical databases: A review of performance measures [J].
Edgar, SJ ;
Holliday, JD ;
Willett, P .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2000, 18 (4-5) :343-357
[10]   Benchmarking sets for molecular docking [J].
Huang, Niu ;
Shoichet, Brian K. ;
Irwin, John J. .
JOURNAL OF MEDICINAL CHEMISTRY, 2006, 49 (23) :6789-6801