Validation of clinical prediction models: what does the "calibration slope" really measure?

被引:80
作者
Stevens, Richard J. [1 ]
Poppe, Katrina K. [2 ]
机构
[1] Univ Oxford, Nuffield Dept Primary Care Hlth Sci, Oxford, England
[2] Univ Auckland, Fac Med & Hlth Sci, Auckland, New Zealand
关键词
Clinical prediction rule; Calibration; Validation; Discrimination; Spread; Slope; LOGISTIC-REGRESSION MODELS; PERFORMANCE;
D O I
10.1016/j.jclinepi.2019.09.016
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background and Objectives: Definitions of calibration, an aspect of model validation, have evolved over time. We examine use and interpretation of the statistic currently referred to as the calibration slope. Methods: The history of the term "calibration slope", and usage in papers published in 2016 and 2017, were reviewed. The behaviour of the slope in illustrative hypothetical examples and in two examples in the clinical literature was demonstrated. Results: The paper in which the statistic was proposed described it as a measure of "spread" and did not use the term "calibration". In illustrative examples, slope of 1 can be associated with good or bad calibration, and this holds true across different definitions of calibration. In data extracted from a previous study, the slope was correlated with discrimination, not overall calibration. Many authors of recent papers interpret the slope as a measure of calibration; a minority interpret it as a measure of discrimination or do not explicitly categorise it as either. Seventeen of thirty-three papers used the slope as the sole measure of calibration. Conclusion: Misunderstanding about this statistic has led to many papers in which it is the sole measure of calibration, which should be discouraged. (C) 2019 The Authors. Published by Elsevier Inc.
引用
收藏
页码:93 / 99
页数:7
相关论文
共 39 条
  • [31] Does the Five Facet Mindfulness Questionnaire Measure What We Think It Does? Construct Validity Evidence From an Active Controlled Randomized Clinical Trial
    Goldberg, Simon B.
    Wielgosz, Joseph
    Dahl, Cortland
    Schuyler, Brianna
    MacCoon, Donal S.
    Rosenkranz, Melissa
    Lutz, Antoine
    Sebranek, Chad A.
    Davidson, Richard J.
    PSYCHOLOGICAL ASSESSMENT, 2016, 28 (08) : 1009 - 1014
  • [32] Development and validation of clinical prediction models for acute kidney injury recovery at hospital discharge in critically ill adults
    Huang, Chao-Yuan
    Guiza, Fabian
    De Vlieger, Greet
    Wouters, Pieter
    Gunst, Jan
    Casaer, Michael
    Vanhorebeek, Ilse
    Derese, Inge
    Van den Berghe, Greet
    Meyfroidt, Geert
    JOURNAL OF CLINICAL MONITORING AND COMPUTING, 2023, 37 (01) : 113 - 125
  • [33] Development and validation of clinical prediction models for acute kidney injury recovery at hospital discharge in critically ill adults
    Chao-Yuan Huang
    Fabian Güiza
    Greet De Vlieger
    Pieter Wouters
    Jan Gunst
    Michael Casaer
    Ilse Vanhorebeek
    Inge Derese
    Greet Van den Berghe
    Geert Meyfroidt
    Journal of Clinical Monitoring and Computing, 2023, 37 : 113 - 125
  • [34] Using interpretability approaches to update "black-box" clinical prediction models: an external validation study in nephrology
    Cruz, Harry Freitas da
    Pfahringer, Boris
    Martensen, Tom
    Schneider, Frederic
    Meyer, Alexander
    Boettinger, Erwin
    Schapranow, Matthieu-P.
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2021, 111
  • [35] Targeted Development and Validation of Clinical Prediction Models in Secondary Care Settings: Opportunities and Challenges for Electronic Health Record Data
    van Maurik, I. S.
    Doodeman, H. J.
    Veeger-Nuijens, B. W.
    Mohringer, R. P. M.
    Sudion, D. R.
    Jongbloed, W.
    van Soelen, E.
    JMIR MEDICAL INFORMATICS, 2024, 12
  • [36] External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb
    Snell, Kym I. E.
    Archer, Lucinda
    Ensor, Joie
    Bonnett, Laura J.
    Debray, Thomas P. A.
    Phillips, Bob
    Collins, Gary S.
    Riley, Richard D.
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2021, 135 : 79 - 89
  • [37] External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: opportunities and challenges
    Riley, Richard D.
    Ensor, Joie
    Snell, Kym I. E.
    Debray, Thomas P. A.
    Altman, Doug G.
    Moons, Karel G. M.
    Collins, Gary S.
    BMJ-BRITISH MEDICAL JOURNAL, 2016, 353
  • [38] The New Prostate Cancer Grading System Does Not Improve Prediction of Clinical Recurrence After Radical Prostatectomy: Results of a Large, Two-Center Validation Study
    Dell'Oglio, Paolo
    Karnes, Robert Jeffrey
    Gandaglia, Giorgio
    Fossati, Nicola
    Stabile, Armando
    Moschini, Marco
    Cucchiara, Vito
    Zaffuto, Emanuele
    Karakiewicz, Pierre I.
    Suardi, Nazareno
    Montorsi, Francesco
    Briganti, Alberto
    PROSTATE, 2017, 77 (03) : 263 - 273
  • [39] Does Adding Single-Nucleotide Polymorphisms to Risk Algorithms Improve Cardiovascular Disease Risk Prediction in Rheumatoid Arthritis? An Internal and External Validation of a Clinical Risk Score
    Agca, Rabia
    Popa, Calin D.
    Heymans, Martijn W.
    Crusius, Bart
    Voskuyl, Alexandre E.
    Nurmohamed, Michael T.
    ARTHRITIS CARE & RESEARCH, 2024, 76 (10) : 1419 - 1426