DCCAT: Dual-Coordinate Cross-Attention Transformer for thrombus segmentation on coronary OCT

被引:2
作者
Chu, Miao [1 ,2 ,3 ]
De Maria, Giovanni Luigi [2 ,3 ]
Dai, Ruobing [1 ]
Benenati, Stefano [2 ,3 ,5 ]
Yu, Wei [1 ]
Zhong, Jiaxin [1 ,6 ]
Kotronias, Rafail [2 ,3 ,4 ]
Walsh, Jason [2 ,3 ,4 ]
Andreaggi, Stefano [2 ,7 ]
Zuccarelli, Vittorio [2 ]
Chai, Jason [2 ,3 ]
Channon, Keith [2 ,3 ,4 ]
Banning, Adrian [2 ,3 ,4 ]
Tu, Shengxian [1 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Biomed Instrument Inst, Sch Biomed Engn, Shanghai, Peoples R China
[2] Oxford Univ Hosp NHS Trust, Oxford Heart Ctr, Oxford, England
[3] Univ Oxford, Radcliffe Dept Med, Div Cardiovasc Med, Oxford, England
[4] Oxford Biomed Res Ctr, Natl Inst Hlth Res, Oxford, England
[5] Univ Genoa, Genoa, Italy
[6] Fujian Med Univ, Union Hosp, Dept Cardiol, Fuzhou, Fujian, Peoples R China
[7] Univ Verona, Dept Med, Div Cardiol, Verona, Italy
基金
中国国家自然科学基金;
关键词
Acute coronary syndromes; Optical coherence tomography; Thrombus segmentation; Cross-attention; OPTICAL COHERENCE TOMOGRAPHY; PLAQUE EROSION; NEURAL-NETWORK; DIAGNOSIS;
D O I
10.1016/j.media.2024.103265
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Acute coronary syndromes (ACS) are one of the leading causes of mortality worldwide, with atherosclerotic plaque rupture and subsequent thrombus formation as the main underlying substrate. Thrombus burden evaluation is important for tailoring treatment therapy and predicting prognosis. Coronary optical coherence tomography (OCT) enables in-vivo visualization of thrombus that cannot otherwise be achieved by other image modalities. However, automatic quantification of thrombus on OCT has not been implemented. The main challenges are due to the variation in location, size and irregularities of thrombus in addition to the small data set. In this paper, we propose a novel dual-coordinate cross-attention transformer network, termed DCCAT, to overcome the above challenges and achieve the first automatic segmentation of thrombus on OCT. Imaging features from both Cartesian and polar coordinates are encoded and fused based on long-range correspondence via multi-head cross-attention mechanism. The dual-coordinate cross-attention block is hierarchically stacked amid convolutional layers at multiple levels, allowing comprehensive feature enhancement. The model was developed based on 5,649 OCT frames from 339 patients and tested using independent external OCT data from 548 frames of 52 patients. DCCAT achieved Dice similarity score (DSC) of 0.706 in segmenting thrombus, which is significantly higher than the CNN-based (0.656) and Transformer-based (0.584) models. We prove that the additional input of polar image not only leverages discriminative features from another coordinate but also improves model robustness for geometrical transformation.Experiment results show that DCCAT achieves competitive performance with only 10% of the total data, highlighting its data efficiency. The proposed dual- coordinate cross-attention design can be easily integrated into other developed Transformer models to boost performance.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Input-output Driven Cross-Attention for Transformer for Quality Prediction of Light Naphtha in Industrial Hydrocracking Processes
    Yang, Ziyi
    Yuan, Xiaofeng
    Wang, Kai
    Chen, Zhiwen
    Wang, Yalin
    Yang, Chunhua
    Gui, Weihua
    IFAC PAPERSONLINE, 2024, 58 (14): : 85 - 90
  • [42] CAT-DTI: cross-attention and Transformer network with domain adaptation for drug-target interaction prediction
    Zeng, Xiaoting
    Chen, Weilin
    Lei, Baiying
    BMC BIOINFORMATICS, 2024, 25 (01)
  • [43] Image-text multimodal classification via cross-attention contextual transformer with modality-collaborative learning
    Shi, Qianyao
    Xu, Wanru
    Miao, Zhenjiang
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (04)
  • [44] Two-stream cross-attention vision Transformer based on RGB-D images for pig weight estimation
    He, Wei
    Mi, Yang
    Ding, Xiangdong
    Liu, Gang
    Li, Tao
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 212
  • [45] Sea Ice Semantic Segmentation in Optical Image Based on Adaptive Training Sample Selection and Cross-Attention ResUNet
    Yin, Zhiyong
    Tang, Yuqi
    Bovolo, Francesca
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [46] DMDC: a cross-attention network for dynamic mask-based dual-camera snapshot hyperspectral Photography
    Cai, Zeyu
    Zhang, Ziyu
    Jin, Chengqian
    Da, Feipeng
    VISUAL COMPUTER, 2024, : 4957 - 4974
  • [47] Cross-attention time-series multi-feature fusion vision transformer for joint formation monitoring in laser scanning welding
    Yan, Shenghong
    Chen, Bo
    Gao, Han
    Tan, Caiwang
    Song, Xiaoguo
    Wang, Guodong
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2025, 229
  • [48] MFSA-Net: Semantic Segmentation With Camera-LiDAR Cross-Attention Fusion Based on Fast Neighbor Feature Aggregation
    Duan, Yijian
    Meng, Liwen
    Meng, Yanmei
    Zhu, Jihong
    Zhang, Jiacheng
    Zhang, Jinlai
    Liu, Xin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 19627 - 19639
  • [49] Cross-Attention Transformer-Based Domain Adaptation: A Novel Method for Fault Diagnosis of Rotating Machinery With High Generalizability and Alignment Capability
    Yin, Hua
    Chen, Qitong
    Chen, Liang
    Shen, Changqing
    IEEE SENSORS JOURNAL, 2024, 24 (23) : 40049 - 40058
  • [50] 3D lymphoma segmentation on PET/CT images via multi-scale information fusion with cross-attention
    Huang, Huan
    Qiu, Liheng
    Yang, Shenmiao
    Li, Longxi
    Nan, Jiaofen
    Li, Yanting
    Han, Chuang
    Zhu, Fubao
    Zhao, Chen
    Zhou, Weihua
    MEDICAL PHYSICS, 2025,