Multimodal transformer network for incomplete image generation and diagnosis of Alzheimer's disease

被引:18
作者
Gao, Xingyu [1 ]
Shi, Feng [2 ]
Shen, Dinggang [2 ,3 ]
Liu, Manhua [1 ,4 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
[2] Shanghai United Imaging Intelligence Co Ltd, Dept Res & Dev, Shanghai, Peoples R China
[3] ShanghaiTech Univ, Sch Biomed Engn, Shanghai, Peoples R China
[4] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Multimodal brain images; Generative adversarial network; Transformer; Image generation; Disease diagnosis; ESTIMATING CT IMAGE; CLASSIFICATION; REPRESENTATION; ROBUST; GAN;
D O I
10.1016/j.compmedimag.2023.102303
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Multimodal images such as magnetic resonance imaging (MRI) and positron emission tomography (PET) could provide complementary information about the brain and have been widely investigated for the diagnosis of neurodegenerative disorders such as Alzheimer's disease (AD). However, multimodal brain images are often incomplete in clinical practice. It is still challenging to make use of multimodality for disease diagnosis with missing data. In this paper, we propose a deep learning framework with the multi-level guided generative adversarial network (MLG-GAN) and multimodal transformer (Mul-T) for incomplete image generation and disease classification, respectively. First, MLG-GAN is proposed to generate the missing data, guided by multi-level information from voxels, features, and tasks. In addition to voxel-level supervision and task-level constraint, a feature-level auto-regression branch is proposed to embed the features of target images for an accurate generation. With the complete multimodal images, we propose a Mul-T network for disease diagnosis, which can not only combine the global and local features but also model the latent interactions and correlations from one modality to another with the cross-modal attention mechanism. Comprehensive experiments on three independent datasets (i.e., ADNI-1, ADNI-2, and OASIS-3) show that the proposed method achieves superior performance in the tasks of image generation and disease diagnosis compared to state-of-the-art methods.
引用
收藏
页数:11
相关论文
共 42 条
[31]   Multi-modal classification of Alzheimer's disease using nonlinear graph fusion [J].
Tong, Tong ;
Gray, Katherine ;
Gao, Qinquan ;
Chen, Liang ;
Rueckert, Daniel .
PATTERN RECOGNITION, 2017, 63 :171-181
[32]   Estimating CT Image From MRI Data Using Structured Random Forest and Auto-Context Model [J].
Tri Huynh ;
Gao, Yaozong ;
Kang, Jiayin ;
Wang, Li ;
Zhang, Pei ;
Lian, Jun ;
Shen, Dinggang .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (01) :174-183
[33]  
Tsai YHH, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P6558, DOI 10.18653/v1/p19-1656
[34]   Multi-modal imaging predicts memory performance in normal aging and cognitive decline [J].
Walhovd, K. B. ;
Fjell, A. M. ;
Dale, A. M. ;
McEvoy, L. K. ;
Brewer, J. ;
Karow, D. S. ;
Salmon, D. P. ;
Fennema-Notestine, C. .
NEUROBIOLOGY OF AGING, 2010, 31 (07) :1107-1121
[35]   Image quality assessment: From error visibility to structural similarity [J].
Wang, Z ;
Bovik, AC ;
Sheikh, HR ;
Simoncelli, EP .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (04) :600-612
[36]   Bi-level multi-source learning for heterogeneous block-wise missing data [J].
Xiang, Shuo ;
Yuan, Lei ;
Fan, Wei ;
Wang, Yalin ;
Thompson, Paul M. ;
Ye, Jieping .
NEUROIMAGE, 2014, 102 :192-206
[37]   Unsupervised MR-to-CT Synthesis Using Structure-Constrained CycleGAN [J].
Yang, Heran ;
Sun, Jian ;
Carass, Aaron ;
Zhao, Can ;
Lee, Junghoon ;
Prince, Jerry L. ;
Xu, Zongben .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (12) :4249-4261
[38]   Multimodal classification of Alzheimer's disease and mild cognitive impairment [J].
Zhang, Daoqiang ;
Wang, Yaping ;
Zhou, Luping ;
Yuan, Hong ;
Shen, Dinggang .
NEUROIMAGE, 2011, 55 (03) :856-867
[39]   Multi-modal latent space inducing ensemble SVM classifier for early dementia diagnosis with neuroimaging data [J].
Zhou, Tao ;
Thung, Kim-Han ;
Liu, Mingxia ;
Shi, Feng ;
Zhang, Changqing ;
Shen, Dinggang .
MEDICAL IMAGE ANALYSIS, 2020, 60
[40]   Latent Representation Learning for Alzheimer's Disease Diagnosis With Incomplete Multi-Modality Neuroimaging and Genetic Data [J].
Zhou, Tao ;
Liu, Mingxia ;
Thung, Kim-Han ;
Shen, Dinggang .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (10) :2411-2422