Regularized Bagged Canonical Component Analysis for Multiclass Learning in Brain Imaging

被引:0
作者
Carlos Sevilla-Salcedo
Vanessa Gómez-Verdejo
Jussi Tohka
机构
[1] Universidad Carlos III de Madrid,Department of Signal Processing and Communications
[2] University of Eastern Finland,A.I. Virtanen Institute for Molecular Sciences
来源
Neuroinformatics | 2020年 / 18卷
关键词
Canonical correlation analysis; Multiclass classification; Feature selection; Brain imaging;
D O I
暂无
中图分类号
学科分类号
摘要
A fundamental problem of supervised learning algorithms for brain imaging applications is that the number of features far exceeds the number of subjects. In this paper, we propose a combined feature selection and extraction approach for multiclass problems. This method starts with a bagging procedure which calculates the sign consistency of the multivariate analysis (MVA) projection matrix feature-wise to determine the relevance of each feature. This relevance measure provides a parsimonious matrix, which is combined with a hypothesis test to automatically determine the number of selected features. Then, a novel MVA regularized with the sign and magnitude consistency of the features is used to generate a reduced set of summary components providing a compact data description. We evaluated the proposed method with two multiclass brain imaging problems: 1) the classification of the elderly subjects in four classes (cognitively normal, stable mild cognitive impairment (MCI), MCI converting to AD in 3 years, and Alzheimer’s disease) based on structural brain imaging data from the ADNI cohort; 2) the classification of children in 3 classes (typically developing, and 2 types of Attention Deficit/Hyperactivity Disorder (ADHD)) based on functional connectivity. Experimental results confirmed that each brain image (defined by 29.852 features in the ADNI database and 61.425 in the ADHD) could be represented with only 30 − 45% of the original features. Furthermore, this information could be redefined into two or three summary components, providing not only a gain of interpretability but also classification rate improvements when compared to state-of-art reference methods.
引用
收藏
页码:641 / 659
页数:18
相关论文
共 206 条
[1]  
Bellec P(2017)The neuro bureau ADHD-200 preprocessed repository NeuroImage 144 275-286
[2]  
Chu C(2003)Dimensionality reduction via sparse support vector machines Journal of Machine Learning Research 3 1229-1243
[3]  
Chouinard-Decorte F(2001)Random forests Machine Learning 45 5-32
[4]  
Benhajali Y(2015)Standardized evaluation of algorithms for computer-aided diagnosis of dementia based on structural MRI: the CADDementia challenge NeuroImage 111 562-579
[5]  
Margulies DS(2012)Sparse reduced-rank regression for simultaneous dimension reduction and variable selection Journal of the American Statistical Association 107 1533-1545
[6]  
Craddock RC(2017)Alzheimer’s Disease Neuroimaging Initiative Multi-domain transfer learning for early diagnosis of Alzheimer’s disease Neuroinformatics 15 115-132
[7]  
Bi J(2019)Alzheimer’s Disease Neuroimaging Initiative Robust multi-label transfer feature learning for early diagnosis of Alzheimer’s disease Brain Imaging and Behavior 13 138-153
[8]  
Bennett K(2012)A whole brain fMRI atlas generated via spatially constrained spectral clustering Human Brain Mapping 33 1914-1928
[9]  
Embrechts M(2018)Alzheimer’s Disease Neuroimaging Initiative Random forest feature selection, fusion and ensemble strategy: Combining multiple morphological MRI measures to discriminate among healhy elderly, MCI, cMCI and Alzheimer’s disease patients: From the Alzheimer’s disease neuroimaging initiative (ADNI) database Journal of Neuroscience Methods 302 14-23
[10]  
Breneman C(2016)Heterogeneity of neuroanatomical patterns in prodromal Alzheimer’s disease: links to cognition, progression and biomarkers Brain: A Journal of Neurology 140 735-747