SMIL-DeiT:Multiple Instance Learning and Self-supervised Vision Transformer network for Early Alzheimer's disease classification

被引:11
作者
Yin, Yue [1 ]
Jin, Weikang [2 ]
Bai, Jing [2 ]
Liu, Ruotong [2 ]
Zhen, Haowei [2 ]
机构
[1] First Affiliated Hosp AFMU, Xian, Peoples R China
[2] Xidian Univ, Xian, Peoples R China
来源
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2022年
基金
中国国家自然科学基金;
关键词
Early Alzheimer's disease; Self-supervised; Multiple Instance Learning; Vision Transformer;
D O I
10.1109/IJCNN55064.2022.9892524
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Early diagnosis of Alzheimer's disease(AD) is becoming increasingly important in preventing and treating the disease as the world's population ages. We proposed a SMIL-DeiT network for AD classification tasks amongst three groups: Alzheimer's Disease (AD), Mild Cognitive Impairment (MCI), and Normal Cognitive (NC) in this study. Vision Transformer is the fundamental structure of our work. The data pre-training is performed utilizing DINO, a self-supervised technique, whereas the downstream classification task is done with Multiple Instance Learning. Our proposed technique works on the ADNI dataset. We used four performance metrics accuracy rates, precision, recall, and F1-score in the evaluation, the most important of which was accuracy. The accuracy obtained by our method is higher than the transformer's 90.1% and CNN's 90.8%, reaching 93.2%.
引用
收藏
页数:6
相关论文
共 22 条
[1]   Emerging Properties in Self-Supervised Vision Transformers [J].
Caron, Mathilde ;
Touvron, Hugo ;
Misra, Ishan ;
Jegou, Herve ;
Mairal, Julien ;
Bojanowski, Piotr ;
Joulin, Armand .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9630-9640
[2]  
Cui ZY, 2019, IEEE INT C BIOINFORM, P2324, DOI [10.1109/bibm47256.2019.8983046, 10.1109/BIBM47256.2019.8983046]
[3]  
Dosovitskiy A, 2020, ARXIV
[4]   Meta-analysis based SVM classification enables accurate detection of Alzheimer's disease across different clinical centers using FDG-PET and MRI [J].
Dukart, Juergen ;
Mueller, Karsten ;
Barthel, Henryk ;
Villringer, Arno ;
Sabri, Osama ;
Schroeter, Matthias Leopold .
PSYCHIATRY RESEARCH-NEUROIMAGING, 2013, 212 (03) :230-236
[5]   Speech perception and loanword adaptations: the case of copy-vowel epenthesis [J].
Guevara-Rukoz, Adriana ;
Yu, Shi ;
Peperkamp, Sharon .
INTERSPEECH 2021, 2021, :4004-4008
[6]  
Guo JM, 2019, IEEE INT CONF BIG DA, P5359
[7]   Effects of medication on neuroimaging findings in bipolar disorder: an updated review [J].
Hafeman, Danella M. ;
Chang, Kiki D. ;
Garrett, Amy S. ;
Sanders, Erica M. ;
Phillips, Mary L. .
BIPOLAR DISORDERS, 2012, 14 (04) :375-410
[8]   Momentum Contrast for Unsupervised Visual Representation Learning [J].
He, Kaiming ;
Fan, Haoqi ;
Wu, Yuxin ;
Xie, Saining ;
Girshick, Ross .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9726-9735
[9]  
He Kaiming, 2021, Masked autoencoders are scalable vision learners
[10]  
Hon M, 2017, IEEE INT C BIOINFORM, P1166, DOI 10.1109/BIBM.2017.8217822