SVM recursive feature elimination analyses of structural brain MRI predicts near-term relapses in patients with clinically isolated syndromes suggestive of multiple sclerosis

被引:45
作者
Wottschel, Viktor [1 ,2 ]
Chard, Declan T. [2 ,3 ]
Enzinger, Christian [4 ]
Filippi, Massimo [5 ]
Frederiksen, Jette L. [6 ,7 ]
Gasperini, Claudio [8 ]
Giorgio, Antonio [9 ]
Rocca, Maria A. [5 ]
Rovira, Alex [10 ]
De Stefano, Nicola [9 ]
Tintore, Mar [10 ]
Alexander, Daniel C. [11 ]
Barkhof, Frederik [1 ,2 ,3 ,12 ]
Ciccarelli, Olga [2 ,3 ]
机构
[1] Univ Amsterdam, Med Ctr, VUmc, Dept Radiol & Nucl Med, Postbus 7057, NL-1007 MB Amsterdam, Netherlands
[2] UCL, Queen Sq MS Ctr, London, England
[3] Univ Coll London Hosp, Natl Inst Hlth Res, Biomed Res Ctr, London, England
[4] Med Univ Graz, Dept Neurol, Res Unit Neuronal Repair & Plast, Graz, Austria
[5] Univ Vita Salute San Raffaele, San Raffaele Sci Inst, Div Neurosci, Neuroimaging Res Unit,Inst Expt Neurol, Milan, Italy
[6] Rigshosp, Glostrup, Denmark
[7] Univ Copenhagen, Copenhagen, Denmark
[8] San Camillo Forlanini Hosp, Rome, Italy
[9] Univ Siena, Siena, Italy
[10] Hosp Valle De Hebron, Barcelona, Spain
[11] UCL, Dept Comp Sci, Ctr Med Image Comp, London, England
[12] UCL, Inst Neurol & Healthcare Engn, London, England
关键词
Multiple sclerosis; Machine learning classification; Feature selection; LESION LOAD; ATROPHY; MATTER; DISABILITY; CLASSIFICATION;
D O I
10.1016/j.nicl.2019.102011
中图分类号
R445 [影像诊断学];
学科分类号
100207 ;
摘要
Machine learning classification is an attractive approach to automatically differentiate patients from healthy subjects, and to predict future disease outcomes. A clinically isolated syndrome (CIS) is often the first pre-sentation of multiple sclerosis (MS), but it is difficult at onset to predict who will have a second relapse and hence convert to clinically definite MS. In this study, we thus aimed to distinguish CIS converters from non-converters at onset of a CIS, using recursive feature elimination and weight averaging with support vector machines. We also sought to assess the influence of cohort size and cross-validation methods on the accuracy estimate of the classification. We retrospectively collected 400 patients with CIS from six European MAGNIMS MS centres. Patients un-derwent brain MRI at onset of a CIS according to local standard-of-care protocols. The diagnosis of clinically definite MS at one-year follow-up was the standard against which the accuracy of the model was tested. For each patient, we derived MRI-based features, such as grey matter probability, white matter lesion load, cortical thickness, and volume of specific cortical and white matter regions. Features with little contribution to the classification model were removed iteratively through an interleaved sample bootstrapping and feature aver-aging approach. Classification of CIS outcome at one-year follow-up was performed with 2-fold, 5-fold, 10-fold and leave-one-out cross-validation for each centre cohort independently and in all patients together. The estimated classification accuracy across centres ranged from 64.9% to 88.1% using 2-fold cross-validation and from 73% to 92.9% using leave-one-out cross-validation. The classification accuracy estimate was higher in single-centre, smaller data sets than in combinations of data sets, being the lowest when all patients were merged together. Regional MRI features such as WM lesions, grey matter probability in the thalamus and the precuneus or cortical thickness in the cuneus and inferior temporal gyrus predicted the occurrence of a second relapse in patients at onset of a CIS using support vector machines. The increased accuracy estimate of the classification achieved with smaller and single-centre samples may indicate a model bias (overfitting) when data points were limited, but also more homogeneous. We provide an overview of classifier performance from a range of cross-validation schemes to give insight into the variability across schemes. The proposed recursive feature elimination approach with weight averaging can be used both in single- and multi-centre data sets in order to bridge the gap between group-level comparisons and making predictions for individual patients.
引用
收藏
页数:8
相关论文
共 32 条
  • [1] An approach for classification of highly imbalanced data using weighting and undersampling
    Anand, Ashish
    Pugalenthi, Ganesan
    Fogel, Gary B.
    Suganthan, P. N.
    [J]. AMINO ACIDS, 2010, 39 (05) : 1385 - 1391
  • [2] A survey of cross-validation procedures for model selection
    Arlot, Sylvain
    Celisse, Alain
    [J]. STATISTICS SURVEYS, 2010, 4 : 40 - 79
  • [3] Atrophy mainly affects the limbic system and the deep grey matter at the first stage of multiple sclerosis
    Audoin, Bertrand
    Zaaraoui, Wafaa
    Reuter, Francoise
    Rico, Audrey
    Malikova, Irina
    Confort-Gouny, Sylviane
    Cozzone, Patrick J.
    Pelletier, Jean
    Ranjeva, Jean-Philippe
    [J]. JOURNAL OF NEUROLOGY NEUROSURGERY AND PSYCHIATRY, 2010, 81 (06) : 690 - 695
  • [4] Basal ganglia, thalamus and neocortical atrophy predicting slowed cognitive processing in multiple sclerosis
    Batista, Sonia
    Zivadinov, Robert
    Hoogs, Marietta
    Bergsland, Niels
    Heininen-Brown, Mari
    Dwyer, Michael G.
    Weinstock-Guttman, Bianca
    Benedict, Ralph H. B.
    [J]. JOURNAL OF NEUROLOGY, 2012, 259 (01) : 139 - 146
  • [5] MRI-based prediction of conversion from clinically isolated syndrome to clinically definite multiple sclerosis using SVM and lesion geometry
    Bendfeldt, Kerstin
    Taschler, Bernd
    Gaetano, Laura
    Madoerin, Philip
    Kuster, Pascal
    Mueller-Lenke, Nicole
    Amann, Michael
    Vrenken, Hugo
    Wottschel, Viktor
    Barkhof, Frederik
    Borgwardt, Stefan
    Kloeppel, Stefan
    Wicklein, Eva-Maria
    Kappos, Ludwig
    Edan, Gilles
    Freedman, Mark S.
    Montalban, Xavier
    Hartung, Hans-Peter
    Pohl, Christoph
    Sandbrink, Rupert
    Sprenger, Till
    Radue, Ernst-Wilhelm
    Wuerfel, Jens
    Nichols, Thomas E.
    [J]. BRAIN IMAGING AND BEHAVIOR, 2019, 13 (05) : 1361 - 1374
  • [6] Multivariate pattern classification of gray matter pathology in multiple sclerosis
    Bendfeldt, Kerstin
    Kloeppel, Stefan
    Nichols, Thomas E.
    Smieskova, Renata
    Kuster, Pascal
    Traud, Stefan
    Mueller-Lenke, Nicole
    Naegelin, Yvonne
    Kappos, Ludwig
    Radue, Ernst-Wilhelm
    Borgwardt, Stefan J.
    [J]. NEUROIMAGE, 2012, 60 (01) : 400 - 408
  • [7] A review of microarray datasets and applied feature selection methods
    Bolon-Canedo, V.
    Sanchez-Marono, N.
    Alonso-Betanzos, A.
    Benitez, J. M.
    Herrera, F.
    [J]. INFORMATION SCIENCES, 2014, 282 : 111 - 135
  • [8] The predictive value of gray matter atrophy in clinically isolated syndromes
    Calabrese, M.
    Rinaldi, F.
    Mattisi, I.
    Bernardi, V.
    Favaretto, A.
    Perini, P.
    Gallo, P.
    [J]. NEUROLOGY, 2011, 77 (03) : 257 - 263
  • [9] Cardoso M, 2012, IMAGE COMPUT, DOI DOI 10.1007/978-3-642-33418-4_33
  • [10] Registration based cortical thickness measurement
    Das, Sandhitsu R.
    Avants, Brian B.
    Grossman, Murray
    Gee, James C.
    [J]. NEUROIMAGE, 2009, 45 (03) : 867 - 879