Illusory generalizability of clinical prediction models

被引:106
作者
Chekroud, Adam M. [1 ,2 ]
Hawrilenko, Matt [1 ]
Loho, Hieronimus [2 ]
Bondar, Julia [1 ]
Gueorguieva, Ralitza [3 ]
Hasan, Alkomiet [4 ]
Kambeitz, Joseph [5 ,6 ]
Corlett, Philip R. [2 ]
Koutsouleris, Nikolaos [7 ]
Krumholz, Harlan M. [8 ]
Krystal, John H. [2 ]
Paulus, Martin [9 ]
机构
[1] Spring Hlth, New York, NY 10010 USA
[2] Yale Univ, Dept Psychiat, Sch Med, New Haven, CT 06520 USA
[3] Yale Univ, Dept Biostat, New Haven, CT 06520 USA
[4] Univ Augsburg, Dept Psychiat Psychotherapy & Psychosomat, D-86159 Augsburg, Germany
[5] Univ Cologne, Fac Med, Dept Psychiat & Psychotherapy, Cologne, Germany
[6] Univ Hosp Cologne, Cologne, Germany
[7] Ludwig Maximilians Univ Munchen, Dept Psychiat & Psychotherapy, Munich, Germany
[8] Yale New Haven Hosp, Ctr Outcomes Res & Evaluat, New Haven, CT 06520 USA
[9] Laureate Inst Brain Res, Tulsa, OK 74136 USA
关键词
REGULARIZATION; SCHIZOPHRENIA; SELECTION; SCALE;
D O I
10.1126/science.adg8538
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
It is widely hoped that statistical models can improve decision-making related to medical treatments. Because of the cost and scarcity of medical outcomes data, this hope is typically based on investigators observing a model's success in one or two datasets or clinical contexts. We scrutinized this optimism by examining how well a machine learning model performed across several independent clinical trials of antipsychotic medication for schizophrenia. Models predicted patient outcomes with high accuracy within the trial in which the model was developed but performed no better than chance when applied out-of-sample. Pooling data across trials to predict outcomes in the trial left out did not improve predictions. These results suggest that models predicting treatment outcomes in schizophrenia are highly context-dependent and may have limited generalizability.
引用
收藏
页码:164 / 167
页数:4
相关论文
共 43 条
[1]  
Ahles TA, 2018, ANNU REV CLIN PSYCHO, V14, P425, DOI [10.1146/annurev-clinpsy-050817084903, 10.1146/annurev-clinpsy-050817-084903]
[2]  
Altman DG, 2000, STAT MED, V19, P453, DOI 10.1002/(SICI)1097-0258(20000229)19:4<453::AID-SIM350>3.3.CO
[3]  
2-X
[4]   Prognosis and prognostic research: validating a prognostic model [J].
Altman, Douglas G. ;
Vergouwe, Yvonne ;
Royston, Patrick ;
Moons, Karel G. M. .
BMJ-BRITISH MEDICAL JOURNAL, 2009, 338 :1432-1435
[5]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[6]   Clinical and Financial Outcomes Associated With a Workplace Mental Health Program Before and During the COVID-19 Pandemic [J].
Bondar, Julia ;
Morrow, Cecina Babich ;
Gueorguieva, Ralitza ;
Brown, Millard ;
Hawrilenko, Matt ;
Krystal, John H. ;
Corlett, Philip R. ;
Chekroud, Adam M. .
JAMA NETWORK OPEN, 2022, 5 (06) :E2216349
[7]  
Brodersen Kay H., 2010, Proceedings of the 2010 20th International Conference on Pattern Recognition (ICPR 2010), P3121, DOI 10.1109/ICPR.2010.764
[8]  
Busner Joan, 2007, Psychiatry (Edgmont), V4, P28
[9]   The perilous path from publication to practice [J].
Chekroud, A. M. ;
Koutsouleris, N. .
MOLECULAR PSYCHIATRY, 2018, 23 (01) :24-25
[10]  
Chekroud A. M., 2023, Code to Accompany Illusory Generalizability of Clinical Prediction Models