Multi-Scale Spatial-Temporal Attention Networks for Functional Connectome Classification

被引:1
作者
Kong, Youyong [1 ,2 ]
Zhang, Xiaotong [3 ]
Wang, Wenhan [1 ,2 ]
Zhou, Yue [4 ]
Li, Yueying [1 ,2 ]
Yuan, Yonggui [4 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Prov Joint Int Res Lab Med Informat Proc, Nanjing 210096, Peoples R China
[2] Southeast Univ, Key Lab New Generat Artificial Intelligence Techno, Minist Educ, Nanjing 210096, Peoples R China
[3] Southeast Univ, Sch Software Engn, Nanjing 210096, Peoples R China
[4] Southeast Univ, Zhongda Hosp, Sch Med, Dept Psychosomat & Psychiat, Nanjing 210009, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Topology; Transformers; Attention mechanisms; Network topology; Representation learning; Functional magnetic resonance imaging; Graph neural networks; spatial-temporal attention; transformer; brain disorder diagnosis; functional connectivity; MAJOR DEPRESSIVE DISORDER; CONNECTIVITY;
D O I
10.1109/TMI.2024.3448214
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Many neuropsychiatric disorders are considered to be associated with abnormalities in the functional connectivity networks of the brain. The research on the classification of functional connectivity can therefore provide new perspectives for understanding the pathology of disorders and contribute to early diagnosis and treatment. Functional connectivity exhibits a nature of dynamically changing over time, however, the majority of existing methods are unable to collectively reveal the spatial topology and time-varying characteristics. Furthermore, despite the efforts of limited spatial-temporal studies to capture rich information across different spatial scales, they have not delved into the temporal characteristics among different scales. To address above issues, we propose a novel Multi-Scale Spatial-Temporal Attention Networks (MSSTAN) to exploit the multi-scale spatial-temporal information provided by functional connectome for classification. To fully extract spatial features of brain regions, we propose a Topology Enhanced Graph Transformer module to guide the attention calculations in the learning of spatial features by incorporating topology priors. A Multi-Scale Pooling Strategy is introduced to obtain representations of brain connectome at various scales. Considering the temporal dynamic characteristics between dynamic functional connectome, we employ Locality Sensitive Hashing attention to further capture long-term dependencies in time dynamics across multiple scales and reduce the computational complexity of the original attention mechanism. Experiments on three brain fMRI datasets of MDD and ASD demonstrate the superiority of our proposed approach. In addition, benefiting from the attention mechanism in Transformer, our results are interpretable, which can contribute to the discovery of biomarkers. The code is available at https://github.com/LIST-KONG/MSSTAN.
引用
收藏
页码:475 / 488
页数:14
相关论文
共 50 条
[31]   Attention based adaptive spatial-temporal hypergraph convolutional networks for stock trend [J].
Su, Hongyang ;
Wang, Xiaolong ;
Qin, Yang ;
Chen, Qingcai .
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
[32]   Target-Aware Tracking With Spatial-Temporal Context Attention [J].
He, Kai-Jie ;
Zhang, Can-Long ;
Xie, Sheng ;
Li, Zhi-Xin ;
Wang, Zhi-Wen ;
Qin, Rui-Guo .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) :7176-7189
[33]   Joint spatial-temporal attention for action recognition [J].
Yu, Tingzhao ;
Guo, Chaoxu ;
Wang, Lingfeng ;
Gu, Huxiang ;
Xiang, Shiming ;
Pan, Chunhong .
PATTERN RECOGNITION LETTERS, 2018, 112 :226-233
[34]   A Multitemporal Scale and Spatial-Temporal Transformer Network for Temporal Action Localization [J].
Gao, Zan ;
Cui, Xinglei ;
Zhuo, Tao ;
Cheng, Zhiyong ;
Liu, An-An ;
Wang, Meng ;
Chen, Shenyong .
IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2023, 53 (03) :569-580
[35]   A Multi-scale Spatial and Temporal Attention Network on Dynamic Connectivity to Localize the Eloquent Cortex in Brain Tumor Patients [J].
Nandakumar, Naresh ;
Manzoor, Komal ;
Agarwal, Shruti ;
Pillai, Jay J. ;
Gujar, Sachin K. ;
Sair, Haris I. ;
Venkataraman, Archana .
INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2021, 2021, 12729 :241-252
[36]   Multi-Scale Contrastive Attention Representation Learning for Encrypted Traffic Classification [J].
Yang, Shuo ;
Zheng, Xinran ;
Li, Jinze ;
Xu, Jinfeng ;
Ngai, Edith C. H. .
PROCEEDINGS OF THE 33RD ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2024, 2024, :4173-4177
[37]   MSSA-DIS: Multi-Scale Spatial Attention with Discriminative Instance Selection for Whole Slide Image Classification [J].
Lin, Yi ;
Li, Yunjiao ;
Long, Xiongbai ;
Ye, Yanyan ;
Guo, Jing ;
Li, Depei .
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2025,
[38]   Spatial-Temporal Graph Boosting Networks: Enhancing Spatial-Temporal Graph Neural Networks via Gradient Boosting [J].
Fan, Yujie ;
Yeh, Chin-Chia Michael ;
Chen, Huiyuan ;
Zheng, Yan ;
Wang, Liang ;
Wang, Junpeng ;
Dai, Xin ;
Zhuang, Zhongfang ;
Zhang, Wei .
PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, :504-513
[39]   Spatial-Temporal Fusion Graph Neural Networks With Mixed Adjacency for Weather Forecasting [J].
Guo, Ang ;
Liu, Yanghe ;
Shao, Shiyu ;
Shi, Xiaowei ;
Feng, Zhenni .
IEEE ACCESS, 2025, 13 :15812-15824
[40]   TRANSTL: SPATIAL-TEMPORAL LOCALIZATION TRANSFORMER FOR MULTI-LABEL VIDEO CLASSIFICATION [J].
Wu, Hongjun ;
Li, Mengzhu ;
Liu, Yongcheng ;
Liu, Hongzhe ;
Xu, Cheng ;
Li, Xuewei .
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, :1965-1969