Predicting Esophageal Fistula Risks Using a Multimodal Self-attention Network

被引:6
作者
Guan, Yulu [1 ]
Cui, Hui [1 ]
Xu, Yiyue [2 ]
Jin, Qiangguo [3 ]
Feng, Tian [4 ]
Tu, Huawei [1 ]
Xuan, Ping [5 ]
Li, Wanlong [2 ]
Wang, Linlin [2 ]
Duh, Been-Lirn [1 ]
机构
[1] La Trobe Univ, Dept Comp Sci & Informat Technol, Melbourne, Vic, Australia
[2] Shandong First Med Univ & Shandong Acad Med Sci, Shandong Canc Hosp & Inst, Dept Radiat Oncol, Jinan, Peoples R China
[3] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[4] Zhejiang Univ, Sch Software Technol, Hangzhou, Peoples R China
[5] Heilongjiang Univ, Dept Comp Sci & Technol, Harbin, Peoples R China
来源
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT V | 2021年 / 12905卷
关键词
Esophageal fistula prediction; Self attention; Multimodal attention;
D O I
10.1007/978-3-030-87240-3_69
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Radiotherapy plays a vital role in treating patients with esophageal cancer (EC), whereas potential complications such as esophageal fistula (EF) can be devastating and even life-threatening. Therefore, predicting EF risks prior to radiotherapies for EC patients is crucial for their clinical treatment and quality of life. We propose a novel method of combining thoracic Computerized Tomography (CT) scans and clinical tabular data to improve the prediction of EF risks in EC patients. The multimodal network includes encoders to extract salient features from images and clinical data, respectively. In addition, we devise a self-attention module, named VisText, to uncover the complex relationships and correlations among different features. The associated multimodal features are integrated with clinical features by aggregation to further enhance prediction accuracy. Experimental results indicate that our method classifies EF status for EC patients with an accuracy of 0.8366, F1 score of 0.7337, specificity of 0.9312 and AUC of 0.9119, outperforming other methods in comparison.
引用
收藏
页码:721 / 730
页数:10
相关论文
共 22 条
  • [1] Surgical treatment of esophageal cancer in the era of multimodality management
    Borggreve, Alicia S.
    Kingma, B. Feike
    Domrachev, Serg A.
    Koshkin, Mikhail A.
    Ruurda, Jelle P.
    van Hillegersberg, Richard
    Takeda, Flavio R.
    Goense, Lucas
    [J]. ANNALS OF THE NEW YORK ACADEMY OF SCIENCES, 2018, 1434 (01) : 192 - 209
  • [2] Chauhan Geeticka, 2020, Med Image Comput Comput Assist Interv, V12262, P529, DOI 10.1007/978-3-030-59713-9_51
  • [3] The current status of multimodality treatment for unresectable locally advanced esophageal squamous cell carcinoma
    Hirano, Hidekazu
    Boku, Narikazu
    [J]. ASIA-PACIFIC JOURNAL OF CLINICAL ONCOLOGY, 2018, 14 (04) : 291 - 299
  • [4] Hui Cui, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12261), P212, DOI 10.1007/978-3-030-59710-8_21
  • [5] Free-form tumor synthesis in computed tomography images via richer generative adversarial network
    Jin, Qiangguo
    Cui, Hui
    Sun, Changming
    Meng, Zhaopeng
    Su, Ran
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 218
  • [6] RA-UNet: A Hybrid Deep Attention-Aware Network to Extract Liver and Tumor in CT Scans
    Jin, Qiangguo
    Meng, Zhaopeng
    Sun, Changming
    Cui, Hui
    Su, Ran
    [J]. FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2020, 8
  • [7] Referring Image Segmentation via Recurrent Refinement Networks
    Li, Ruiyu
    Li, Kaican
    Kuo, Yi-Chun
    Shu, Michelle
    Qi, Xiaojuan
    Shen, Xiaoyong
    Jia, Jiaya
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5745 - 5753
  • [8] Attention gated networks: Learning to leverage salient regions in medical images
    Schlemper, Jo
    Oktay, Ozan
    Schaap, Michiel
    Heinrich, Mattias
    Kainz, Bernhard
    Glocker, Ben
    Rueckert, Daniel
    [J]. MEDICAL IMAGE ANALYSIS, 2019, 53 : 197 - 207
  • [9] DeepInsight: A methodology to transform a non-image data to an image for convolution neural network architecture
    Sharma, Alok
    Vans, Edwin
    Shigemizu, Daichi
    Boroevich, Keith A.
    Tsunoda, Tatsuhiko
    [J]. SCIENTIFIC REPORTS, 2019, 9 (1)
  • [10] Key-Word-Aware Network for Referring Expression Image Segmentation
    Shi, Hengcan
    Li, Hongliang
    Meng, Fanman
    Wu, Qingbo
    [J]. COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 : 38 - 54