Structure-Guided Cross-Attention Network for Cross-Domain OCT Fluid Segmentation

被引:4
作者
He, Xingxin [1 ]
Zhong, Zhun [2 ]
Fang, Leyuan [1 ]
He, Min [1 ,3 ]
Sebe, Nicu [2 ]
机构
[1] Hunan Univ, Coll Elect & Informat Engn, Changsha 410082, Peoples R China
[2] Univ Trento, Dept Informat Engn & Comp Sci DISI, I-38122 Trento, Italy
[3] Zhejiang Canc Hosp, Key Lab Head & Neck Canc Translat Res Zhejiang Pr, Hangzhou 310022, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Optical coherence tomography; retinal fluid segmentation; cross-domain segmentation; retinal structure; MACULAR EDEMA; AUTOMATED SEGMENTATION; SEMANTIC SEGMENTATION; RETINAL LAYER; ADAPTATION; FRAMEWORK;
D O I
10.1109/TIP.2022.3228163
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate retinal fluid segmentation on Optical Coherence Tomography (OCT) images plays an important role in diagnosing and treating various eye diseases. The art deep models have shown promising performance on OCT image segmentation given pixel-wise annotated training data. However, the learned model will achieve poor performance on OCT images that are obtained from different devices (domains) due to the domain shift issue. This problem largely limits the real-world application of OCT image segmentation since the types of devices usually are different in each hospital. In this paper, we study the task of cross-domain OCT fluid segmentation, where we are given a labeled dataset of the source device (domain) and an unlabeled dataset of the target device (domain). The goal is to learn a model that can perform well on the target domain. To solve this problem, in this paper, we propose a novel Structure-guided Cross-Attention Network (SCAN), which leverages the retinal layer structure to facilitate domain alignment. Our SCAN is inspired by the fact that the retinal layer structure is robust to domains and can reflect regions that are important to fluid segmentation. In light of this, we build our SCAN in a multi-task manner by jointly learning the retinal structure prediction and fluid segmentation. To exploit the mutual benefit between layer structure and fluid segmentation, we further introduce a cross-attention module to measure the correlation between the layer-specific feature and the fluid-specific feature encouraging the model to concentrate on highly relative regions during domain alignment. Moreover, an adaptation difficulty map is evaluated based on the retinal structure predictions from different domains, which enforces the model focus on hard regions during structure-aware adversarial learning. Extensive experiments on the three domains of the RETOUCH dataset demonstrate the effectiveness of the proposed method and show that our approach produces state-of-the-art performance on cross-domain OCT fluid segmentation.
引用
收藏
页码:309 / 320
页数:12
相关论文
共 58 条
  • [1] Berthelot D, 2019, ADV NEUR IN, V32
  • [2] RETOUCH: The Retinal OCT Fluid Detection and Segmentation Benchmark and Challenge
    Bogunovic, Hrvoje
    Venhuizen, Freerk
    Klimscha, Sophie
    Apostolopoulos, Stefanos
    Bab-Hadiashar, Alireza
    Bagci, Ulas
    Beg, Mirza Faisal
    Bekalo, Loza
    Chen, Qiang
    Ciller, Carlos
    Gopinath, Karthik
    Gostar, Amirali K.
    Jeon, Kiwan
    Ji, Zexuan
    Kang, Sung Ho
    Koozekanani, Dara D.
    Lu, Donghuan
    Morley, Dustin
    Parhi, Keshab K.
    Park, Hyoung Suk
    Rashno, Abdolreza
    Sarunic, Marinko
    Shaikh, Saad
    Sivaswamy, Jayanthi
    Tennakoon, Ruwan
    Yadav, Shivin
    De Zanet, Sandro
    Waldstein, Sebastian M.
    Gerendas, Bianca S.
    Klaver, Caroline
    Sanchez, Clara, I
    Schmidt-Erfurth, Ursula
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (08) : 1858 - 1874
  • [3] Multiple-source domain adaptation with generative adversarial nets
    Chen, Chaoqi
    Xie, Weiping
    Wen, Yi
    Huang, Yue
    Ding, Xinghao
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 199 (199)
  • [4] Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
  • [5] CrDoCo: Pixel-level Domain Transfer with Cross-Domain Consistency
    Chen, Yun-Chun
    Lin, Yen-Yu
    Yang, Ming-Hsuan
    Huang, Jia-Bin
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1791 - 1800
  • [6] Kernel regression based segmentation of optical coherence tomography images with diabetic macular edema
    Chiu, Stephanie J.
    Allingham, Michael J.
    Mettu, Priyatham S.
    Cousins, Scott W.
    Izatt, Joseph A.
    Farsiu, Sina
    [J]. BIOMEDICAL OPTICS EXPRESS, 2015, 6 (04): : 1172 - 1194
  • [7] Csurka G., 2021, arXiv
  • [8] Clinically applicable deep learning for diagnosis and referral in retinal disease
    De Fauw, Jeffrey
    Ledsam, Joseph R.
    Romera-Paredes, Bernardino
    Nikolov, Stanislav
    Tomasev, Nenad
    Blackwell, Sam
    Askham, Harry
    Glorot, Xavier
    O'Donoghue, Brendan
    Visentin, Daniel
    van den Driessche, George
    Lakshminarayanan, Balaji
    Meyer, Clemens
    Mackinder, Faith
    Bouton, Simon
    Ayoub, Kareem
    Chopra, Reena
    King, Dominic
    Karthikesalingam, Alan
    Hughes, Cian O.
    Raine, Rosalind
    Hughes, Julian
    Sim, Dawn A.
    Egan, Catherine
    Tufail, Adnan
    Montgomery, Hugh
    Hassabis, Demis
    Rees, Geraint
    Back, Trevor
    Khaw, Peng T.
    Suleyman, Mustafa
    Cornebise, Julien
    Keane, Pearse A.
    Ronneberger, Olaf
    [J]. NATURE MEDICINE, 2018, 24 (09) : 1342 - +
  • [9] Ganin Y, 2016, J MACH LEARN RES, V17
  • [10] Ganin Y, 2015, PR MACH LEARN RES, V37, P1180