Replication of machine learning methods to predict treatment outcome with antidepressant medications in patients with major depressive disorder from STAR*D and CAN-BIND-1

被引:16
作者
Nunez, John-Jose [1 ,2 ]
Nguyen, Teyden T. [2 ]
Zhou, Yihan [2 ]
Cao, Bo [3 ]
Ng, Raymond T. [2 ]
Chen, Jun [4 ]
Frey, Benicio N. [5 ]
Milev, Roumen [6 ,7 ]
Muller, Daniel J. [8 ]
Rotzinger, Susan [8 ]
Soares, Claudio N. [5 ]
Uher, Rudolf [9 ]
Kennedy, Sidney H. [8 ]
Lam, Raymond W. [1 ]
机构
[1] Univ British Columbia, Dept Psychiat, Vancouver, BC, Canada
[2] Univ British Columbia, Dept Comp Sci, Vancouver, BC, Canada
[3] Univ Alberta, Dept Psychiat, Edmonton, AB, Canada
[4] Shanghai Mental Hlth Ctr, Shanghai, Peoples R China
[5] McMaster Univ, Dept Psychiat & Behav Neurosci, Hamilton, ON, Canada
[6] Queens Univ, Dept Psychiat, Kingston, ON, Canada
[7] Queens Univ, Dept Psychol, Kingston, ON, Canada
[8] Univ Toronto, Dept Psychiat, Toronto, ON, Canada
[9] Dalhousie Univ, Dept Psychiat, Halifax, NS, Canada
基金
加拿大健康研究院;
关键词
IMPAIRMENT; SCALE; REPRODUCIBILITY; QUESTIONNAIRE; PRODUCTIVITY; VARIABLES; EFFICACY; MODEL; WORK;
D O I
10.1371/journal.pone.0253023
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Objectives Antidepressants are first-line treatments for major depressive disorder (MDD), but 40-60% of patients will not respond, hence, predicting response would be a major clinical advance. Machine learning algorithms hold promise to predict treatment outcomes based on clinical symptoms and episode features. We sought to independently replicate recent machine learning methodology predicting antidepressant outcomes using the Sequenced Treatment Alternatives to Relieve Depression (STAR*D) dataset, and then externally validate these methods to train models using data from the Canadian Biomarker Integration Network in Depression (CAN-BIND-1) dataset. Methods We replicated methodology from Nie et al (2018) using common algorithms based on linear regressions and decision trees to predict treatment-resistant depression (TRD, defined as failing to respond to 2 or more antidepressants) in the STAR*D dataset. We then trained and externally validated models using the clinical features found in both datasets to predict response (>= 50% reduction on the Quick Inventory for Depressive Symptomatology, Self-Rated [QIDS-SR]) and remission (endpoint QIDS-SR score <= 5) in the CAN-BIND-1 dataset. We evaluated additional models to investigate how different outcomes and features may affect prediction performance. Results Our replicated models predicted TRD in the STAR*D dataset with slightly better balanced accuracy than Nie et al (70%-73% versus 64%-71%, respectively). Prediction performance on our external methodology validation on the CAN-BIND-1 dataset varied depending on outcome; performance was worse for response (best balanced accuracy 65%) compared to remission (77%). Using the smaller set of features found in both datasets generally improved prediction performance when evaluated on the STAR*D dataset. Conclusion We successfully replicated prior work predicting antidepressant treatment outcomes using machine learning methods and clinical data. We found similar prediction performance using these methods on an external database, although prediction of remission was better than prediction of response. Future work is needed to improve prediction performance to be clinically useful.
引用
收藏
页数:15
相关论文
共 46 条
[1]   Prognosis and prognostic research: validating a prognostic model [J].
Altman, Douglas G. ;
Vergouwe, Yvonne ;
Royston, Patrick ;
Moons, Karel G. M. .
BMJ-BRITISH MEDICAL JOURNAL, 2009, 338 :1432-1435
[2]  
[Anonymous], 2013, DIAGN STAT MAN MENT
[3]  
[Anonymous], 2018, NAT HUM BEHAV, DOI DOI 10.1038/s41562-018-0399-z
[4]  
[Anonymous], DEPRESSION TREATMENT
[5]  
Batista G.E.A.P.A., 2004, ACM SIGKDD Explor. Newsl, V6, P20, DOI [DOI 10.1145/1007730.1007735, 10.1145/1007730.1007735]
[6]   The efficacy and safety of aripiprazole as adjunctive therapy in major depressive disorder: A multicenter, randomized, double-blind, placebo-controlled study [J].
Berman, Robert M. ;
Marcus, Ronald N. ;
Swanink, Rene ;
McQuade, Robert D. ;
Carson, William H. ;
Corey-Lisle, Patricia K. ;
Khan, Arif .
JOURNAL OF CLINICAL PSYCHIATRY, 2007, 68 (06) :843-853
[7]   Reporting and Methods in Clinical Prediction Research: A Systematic Review [J].
Bouwmeester, Walter ;
Zuithoff, Nicolaas P. A. ;
Mallett, Susan ;
Geerlings, Mirjam I. ;
Vergouwe, Yvonne ;
Steyerberg, Ewout W. ;
Altman, Douglas G. ;
Moons, Karel G. M. .
PLOS MEDICINE, 2012, 9 (05)
[8]  
Brodersen Kay H., 2010, Proceedings of the 2010 20th International Conference on Pattern Recognition (ICPR 2010), P3121, DOI 10.1109/ICPR.2010.764
[9]   Correlated variables in regression: Clustering and sparse estimation [J].
Buehlmann, Peter ;
Ruetimann, Philipp ;
van de Geer, Sara ;
Zhang, Cun-Hui .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2013, 143 (11) :1835-1858
[10]   Reevaluating the Efficacy and Predictability of Antidepressant Treatments A Symptom Clustering Approach [J].
Chekroud, Adam M. ;
Gueorguieva, Ralitza ;
Krumholz, Harlan M. ;
Trivedi, Madhukar H. ;
Krystal, John H. ;
McCarthy, Gregory .
JAMA PSYCHIATRY, 2017, 74 (04) :370-378