DCCAT: Dual-Coordinate Cross-Attention Transformer for thrombus segmentation on coronary OCT

被引:2
|
作者
Chu, Miao [1 ,2 ,3 ]
De Maria, Giovanni Luigi [2 ,3 ]
Dai, Ruobing [1 ]
Benenati, Stefano [2 ,3 ,5 ]
Yu, Wei [1 ]
Zhong, Jiaxin [1 ,6 ]
Kotronias, Rafail [2 ,3 ,4 ]
Walsh, Jason [2 ,3 ,4 ]
Andreaggi, Stefano [2 ,7 ]
Zuccarelli, Vittorio [2 ]
Chai, Jason [2 ,3 ]
Channon, Keith [2 ,3 ,4 ]
Banning, Adrian [2 ,3 ,4 ]
Tu, Shengxian [1 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Biomed Instrument Inst, Sch Biomed Engn, Shanghai, Peoples R China
[2] Oxford Univ Hosp NHS Trust, Oxford Heart Ctr, Oxford, England
[3] Univ Oxford, Radcliffe Dept Med, Div Cardiovasc Med, Oxford, England
[4] Oxford Biomed Res Ctr, Natl Inst Hlth Res, Oxford, England
[5] Univ Genoa, Genoa, Italy
[6] Fujian Med Univ, Union Hosp, Dept Cardiol, Fuzhou, Fujian, Peoples R China
[7] Univ Verona, Dept Med, Div Cardiol, Verona, Italy
基金
中国国家自然科学基金;
关键词
Acute coronary syndromes; Optical coherence tomography; Thrombus segmentation; Cross-attention; OPTICAL COHERENCE TOMOGRAPHY; PLAQUE EROSION; NEURAL-NETWORK; DIAGNOSIS;
D O I
10.1016/j.media.2024.103265
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Acute coronary syndromes (ACS) are one of the leading causes of mortality worldwide, with atherosclerotic plaque rupture and subsequent thrombus formation as the main underlying substrate. Thrombus burden evaluation is important for tailoring treatment therapy and predicting prognosis. Coronary optical coherence tomography (OCT) enables in-vivo visualization of thrombus that cannot otherwise be achieved by other image modalities. However, automatic quantification of thrombus on OCT has not been implemented. The main challenges are due to the variation in location, size and irregularities of thrombus in addition to the small data set. In this paper, we propose a novel dual-coordinate cross-attention transformer network, termed DCCAT, to overcome the above challenges and achieve the first automatic segmentation of thrombus on OCT. Imaging features from both Cartesian and polar coordinates are encoded and fused based on long-range correspondence via multi-head cross-attention mechanism. The dual-coordinate cross-attention block is hierarchically stacked amid convolutional layers at multiple levels, allowing comprehensive feature enhancement. The model was developed based on 5,649 OCT frames from 339 patients and tested using independent external OCT data from 548 frames of 52 patients. DCCAT achieved Dice similarity score (DSC) of 0.706 in segmenting thrombus, which is significantly higher than the CNN-based (0.656) and Transformer-based (0.584) models. We prove that the additional input of polar image not only leverages discriminative features from another coordinate but also improves model robustness for geometrical transformation.Experiment results show that DCCAT achieves competitive performance with only 10% of the total data, highlighting its data efficiency. The proposed dual- coordinate cross-attention design can be easily integrated into other developed Transformer models to boost performance.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Twins transformer: Cross-attention based two-branch transformer network for rotating bearing fault diagnosis
    Li, Jie
    Bao, Yu
    Liu, Wenxin
    Ji, Pengxiang
    Wang, Lekang
    Wang, Zhongbing
    MEASUREMENT, 2023, 223
  • [22] GazeSymCAT: A symmetric cross-attention transformer for robust gaze estimation under extreme head poses and gaze variations
    Zhong, Yupeng
    Lee, Sang Hun
    JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2025, 12 (03) : 115 - 129
  • [23] Multimodal Dual Cross-Attention Fusion Strategy for Autonomous Garbage Classification System
    Xu, Huxiu
    Tang, Wei
    Li, Zhaoyang
    Qin, Kecheng
    Zou, Jun
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (11) : 13319 - 13329
  • [24] CrossU-Net: Dual-modality cross-attention U-Net for segmentation of precancerous lesions in gastric cancer
    Wang, Jiansheng
    Zhang, Benyan
    Wang, Yan
    Zhou, Chunhua
    Vonsky, Maxim S.
    Mitrofanova, Lubov B.
    Zou, Duowu
    Li, Qingli
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2024, 112
  • [25] Few Shot Medical Image Segmentation with Cross Attention Transformer
    Lin, Yi
    Chen, Yufan
    Cheng, Kwang-Ting
    Chen, Hao
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT II, 2023, 14221 : 233 - 243
  • [26] CAF-ViT: A cross-attention based Transformer network for underwater acoustic target recognition
    Dong, Wenfeng
    Fu, Jin
    Zou, Nan
    Zhao, Chunpeng
    Miao, Yixin
    Shen, Zheng
    OCEAN ENGINEERING, 2025, 318
  • [27] Interactive CNN and Transformer-Based Cross-Attention Fusion Network for Medical Image Classification
    Cai, Shu
    Zhang, Qiude
    Wang, Shanshan
    Hu, Junjie
    Zeng, Liang
    Li, Kaiyan
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2025, 35 (03)
  • [28] Intersection-union dual-stream cross-attention Lova-SwinUnet for skin cancer hair segmentation and image repair
    Qin, Juanjuan
    Pei, Dong
    Guo, Qian
    Cai, Xingjuan
    Xie, Liping
    Zhang, Wensheng
    Computers in Biology and Medicine, 2024, 180
  • [29] Cross-attention based dual-similarity network for few-shot learning
    Sim, Chan
    Kim, Gyeonghwan
    PATTERN RECOGNITION LETTERS, 2024, 186 : 1 - 6
  • [30] DACFusion: Dual Asymmetric Cross-Attention guided feature fusion for multispectral object detection
    Qian, Jingchen
    Qiao, Baiyou
    Zhang, Yuekai
    Liu, Tongyan
    Wang, Shuo
    Wu, Gang
    Han, Donghong
    NEUROCOMPUTING, 2025, 635