Comparing classification models-a practical tutorial

被引:6
|
作者
Walters, W. Patrick [1 ]
机构
[1] Relay Therapeut, 399 Binney St, Cambridge, MA 02141 USA
关键词
QSAR; Classification model; Statistical validation; Machine learning; Tutorial;
D O I
10.1007/s10822-021-00417-2
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
While machine learning models have become a mainstay in Cheminformatics, the field has yet to agree on standards for model evaluation and comparison. In many cases, authors compare methods by performing multiple folds of cross-validation and reporting the mean value for an evaluation metric such as the area under the receiver operating characteristic. These comparisons of mean values often lack statistical rigor and can lead to inaccurate conclusions. In the interest of encouraging best practices, this tutorial provides an example of how multiple methods can be compared in a statistically rigorous fashion.
引用
收藏
页码:381 / 389
页数:9
相关论文
共 50 条
  • [31] Comparing open source power system models-A case study focusing on fundamental modeling parameters for the German energy transition
    van Ouwerkerk, Jonas
    Hainsch, Karlo
    Candas, Soner
    Muschner, Christoph
    Buchholz, Stefanie
    Guenther, Stephan
    Huyskens, Hendrik
    Berendes, Sarah
    Loeffler, Konstantin
    Bussar, Christian
    Tardasti, Fateme
    von Koeckritz, Luja
    Bramstoft, Rasmus
    RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2022, 161
  • [32] Practical Laboratory Work and Didactic Models: a classification proposal
    Zorrilla, Erica Gabriela
    Mazzitelli, Claudia Alejandra
    DIDACTICA DE LAS CIENCIAS EXPERIMENTALES Y SOCIALES, 2021, (40): : 133 - 147
  • [33] COMPARING THE EFFECTIVENESS OF CLINICAL AND TUTORIAL INSTRUCTION
    DESCH, LW
    DEVELOPMENTAL MEDICINE AND CHILD NEUROLOGY, 1986, 28 (05): : 48 - 49
  • [34] A tutorial on frailty models
    Balan, Theodor A.
    Putter, Hein
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (11) : 3424 - 3454
  • [35] A Practical Tutorial on Explainable AI Techniques
    Bennetot, Adrien
    Donadello, Ivan
    Haouari, Ayoub el qadi el
    Dragoni, Mauro
    Frossard, Thomas
    Wagner, Benedikt
    Sarranti, Anna
    Tulli, Silvia
    Trocan, Maria
    Chatila, Raja
    Holzinger, Andreas
    Garcez, Artur d'avila
    Diaz-rodriguez, Natalia
    ACM COMPUTING SURVEYS, 2025, 57 (02)
  • [36] Tutorial: Practical Verification of Network Programs
    Foster, Nate
    Guha, Arjun
    Reitblatt, Mark
    Schlesinger, Cole
    2013 FORMAL METHODS IN COMPUTER-AIDED DESIGN (FMCAD), 2013, : 9 - +
  • [37] Electronic Commerce Internet Strategies and Business Models-A Survey
    Porra J.
    Information Systems Frontiers, 2000, 1 (4) : 389 - 399
  • [38] Sparse Functional Dynamical Models-A Big Data Approach
    Sienkiewicz, Ela
    Song, Dong
    Breidt, F. Jay
    Wang, Haonan
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2017, 26 (02) : 319 - 329
  • [39] A Practical Tutorial on Graph Neural Networks
    Ward, Isaac Ronald
    Joyner, Jack
    Lickfold, Casey
    Guo, Yulan
    Bennamoun, Mohammed
    ACM COMPUTING SURVEYS, 2022, 54 (10S)
  • [40] Epistemic uncertainty in catastrophe models-A base level examination
    Born, Patricia
    Dumm, Randy
    Johnson, Mark E.
    RISK MANAGEMENT AND INSURANCE REVIEW, 2023, 26 (02) : 247 - 269