Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants

被引:12
作者
Lee, Lung-Hao [1 ,2 ,3 ]
Chen, Chang-Hao [1 ,3 ]
Chang, Wan-Chen [4 ,5 ,6 ]
Lee, Po-Lei [1 ,3 ]
Shyu, Kuo-Kai [1 ,3 ]
Chen, Mu-Hong [6 ,7 ]
Hsu, Ju-Wei [6 ,7 ]
Bai, Ya-Mei [6 ,7 ,8 ]
Su, Tung-Ping [7 ,8 ,9 ]
Tu, Pei-Chi [5 ,6 ,7 ,10 ]
机构
[1] Natl Cent Univ, Dept Elect Engn, Taoyuan, Taiwan
[2] Kaohsiung Med Univ, Coll Med, Dept Med Humanities & Educ, Kaohsiung, Taiwan
[3] Pervas Artificial Intelligence Res PAIR Labs, Hsinchu, Taiwan
[4] Natl Yang Ming Chiao Tung Univ, Dept Biomed Engn, Taipei, Taiwan
[5] Taipei Vet Gen Hosp, Dept Med Res, Taipei, Taiwan
[6] Taipei Vet Gen Hosp, Dept Psychiat, Taipei, Taiwan
[7] Natl Yang Ming Chiao Tung Univ, Fac Med, Dept Psychiat, Taipei, Taiwan
[8] Natl Yang Ming Chiao Tung Univ, Inst Brain Sci, Taipei, Taiwan
[9] Cheng Hsin Gen Hosp, Dept Psychiat, Taipei, Taiwan
[10] Natl Yang Ming Chiao Tung Univ, Inst Philosophy Mind & Cognit, Taipei, Taiwan
关键词
Automatic classification; functional connectivity; homogeneous; schizophrenic disorder; support vector machine; training sample size; FUNCTIONAL CONNECTIVITY; NETWORK; BRAIN; FMRI; PARCELLATION; DYSCONNECTIVITY; CORTEX;
D O I
10.1192/j.eurpsy.2021.2248
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Background Support vector machines (SVMs) based on brain-wise functional connectivity (FC) have been widely adopted for single-subject prediction of patients with schizophrenia, but most of them had small sample size. This study aimed to evaluate the performance of SVMs based on a large single-site dataset and investigate the effects of demographic homogeneity and training sample size on classification accuracy. Methods The resting functional Magnetic Resonance Imaging (fMRI) dataset comprised 220 patients with schizophrenia and 220 healthy controls. Brain-wise FCs was calculated for each participant and linear SVMs were developed for automatic classification of patients and controls. First, we evaluated the SVMs based on all participants and homogeneous subsamples of men, women, younger (18-30 years), and older (31-50 years) participants by 10-fold nested cross-validation. Then, we hold out a fixed test set of 40 participants (20 patients and 20 controls) and evaluated the SVMs based on incremental training sample sizes (N = 40, 80, horizontal ellipsis , 400). Results We found that the SVMs based on all participants had accuracy of 85.05%. The SVMs based on male, female, young, and older participants yielded accuracy of 84.66, 81.56, 80.50, and 86.13%, respectively. Although the SVMs based on older subsamples had better performance than those based on all participants, they generalized poorly to younger participants (77.24%). For incremental training sizes, the classification accuracy increased stepwise from 72.6 to 83.3%, with >80% accuracy achieved with sample size >240. Conclusions The findings indicate that SVMs based on a large dataset yield high classification accuracy and establish models using a large sample size with heterogeneous properties are recommended for single subject prediction of schizophrenia.
引用
收藏
页数:10
相关论文
共 47 条
[1]   Cognitive dysmetria as an integrative theory of schizophrenia: A dysfunction in cortical subcortical-cerebellar circuitry? [J].
Andreasen, NC ;
Paradiso, S ;
O'Leary, DS .
SCHIZOPHRENIA BULLETIN, 1998, 24 (02) :203-218
[2]  
[Anonymous], 2011, Acm T. Intel. Syst. Tec., DOI DOI 10.1145/1961189.1961199
[3]   Association of Thalamic Dysconnectivity and Conversion to Psychosis in Youth and Young Adults at Elevated Clinical Risk [J].
Anticevic, Alan ;
Haut, Kristen ;
Murray, John D. ;
Repovs, Grega ;
Yang, Genevieve J. ;
Diehl, Caroline ;
McEwen, Sarah C. ;
Bearden, Carrie E. ;
Addington, Jean ;
Goodyear, Bradley ;
Cadenhead, Kristin S. ;
Mirzakhanian, Heline ;
Cornblatt, Barbara A. ;
Olvet, Doreen ;
Mathalon, Daniel H. ;
McGlashan, Thomas H. ;
Perkins, Diana O. ;
Belger, Aysenil ;
Seidman, Larry J. ;
Tsuang, Ming T. ;
van Erp, Theo G. M. ;
Walker, Elaine F. ;
Hamann, Stephan ;
Woods, Scott W. ;
Qiu, Maolin ;
Cannon, Tyrone D. .
JAMA PSYCHIATRY, 2015, 72 (09) :882-891
[4]   Single subject prediction of brain disorders in neuroimaging: Promises and pitfalls [J].
Arbabshirani, Mohammad R. ;
Plis, Sergey ;
Sui, Jing ;
Calhoun, Vince D. .
NEUROIMAGE, 2017, 145 :137-165
[5]   Classification of schizophrenia patients based on resting-state functional network connectivity [J].
Arbabshirani, Mohammad R. ;
Kiehl, Kent A. ;
Pearlson, Godfrey D. ;
Calhoun, Vince D. .
FRONTIERS IN NEUROSCIENCE, 2013, 7
[6]   FUNCTIONAL CONNECTIVITY IN THE MOTOR CORTEX OF RESTING HUMAN BRAIN USING ECHO-PLANAR MRI [J].
BISWAL, B ;
YETKIN, FZ ;
HAUGHTON, VM ;
HYDE, JS .
MAGNETIC RESONANCE IN MEDICINE, 1995, 34 (04) :537-541
[7]   Generalizability of machine learning for classification of schizophrenia based on resting-state functional MRI data [J].
Cai, Xin-Lu ;
Xie, Dong-Jie ;
Madsen, Kristoffer H. ;
Wang, Yong-Ming ;
Bogemann, Sophie Alida ;
Cheung, Eric F. C. ;
Moller, Arne ;
Chan, Raymond C. K. .
HUMAN BRAIN MAPPING, 2020, 41 (01) :172-184
[8]   Recommendations and future directions for supervised machine learning in psychiatry [J].
Cearns, Micah ;
Hahn, Tim ;
Baune, Bernhard T. .
TRANSLATIONAL PSYCHIATRY, 2019, 9 (1)
[9]   Using Minimal-Redundant and Maximal-Relevant Whole-Brain Functional Connectivity to Classify Bipolar Disorder [J].
Chen, Yen-Ling ;
Tu, Pei-Chi ;
Huang, Tzu-Hsuan ;
Bai, Ya-Mei ;
Su, Tung-Ping ;
Chen, Mu-Hong ;
Wu, Yu-Te .
FRONTIERS IN NEUROSCIENCE, 2020, 14
[10]   Resting-state fMRI mapping of cerebellar functional dysconnections involving multiple large-scale networks in patients with schizophrenia [J].
Chen, Yen-Ling ;
Tu, Pei-Chi ;
Lee, Ying-Chiao ;
Chen, Ying-Shiue ;
Li, Cheng-Ta ;
Su, Tung-Ping .
SCHIZOPHRENIA RESEARCH, 2013, 149 (1-3) :26-34