Joint image and feature adaptative attention-aware networks for cross-modality semantic segmentation

被引:3
作者
Zhong, Qihuang [1 ,2 ,3 ]
Zeng, Fanzhou [1 ]
Liao, Fei [1 ]
Liu, Juhua [2 ,3 ]
Du, Bo [3 ,4 ,5 ]
Shang, Jedi S. [6 ]
机构
[1] Wuhan Univ, Renmin Hosp, Dept Gastroenterol, Wuhan, Peoples R China
[2] Wuhan Univ, Sch Printing & Packaging, Wuhan, Peoples R China
[3] Wuhan Univ, Inst Artificial Intelligence, Natl Engn Res Ctr Multimedia Software, Wuhan, Peoples R China
[4] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[5] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China
[6] Thinvent Technol Co LTD, Nanchang, Jiangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Domain adaptation; Attention; Cross-modality; Semantic segmentation; AUTOMATED SEGMENTATION; PATCH;
D O I
10.1007/s00521-021-06064-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning-based methods are widely used for the task of semantic segmentation in recent years. However, due to the difficulty and labor cost of collecting pixel-level annotations, it is hard to acquire sufficient training images for a certain imaging modality, which greatly hinders the performance of these methods. The intuitive solution to this issue is to train a pre-trained model on label-rich imaging modality (source domain) and then apply the pre-trained model to the label-poor imaging modality (target domain). Unsurprisingly, since the severe domain shift between different modalities, the pre-trained model would perform poorly on the target imaging modality. To this end, we propose a novel unsupervised domain adaptation framework, called Joint Image and Feature Adaptive Attention-aware Networks (JIFAAN), to alleviate the domain shift for cross-modality semantic segmentation. The proposed framework mainly consists of two procedures. The first procedure is image adaptation, which transforms the source domain images into target-like images using the adversarial learning with cycle-consistency constraint. For further bridging the gap between transformed images and target domain images, the second procedure employs feature adaptation to extract the domain-invariant features and thus aligns the distribution in feature space. In particular, we introduce an attention module in the feature adaptation to focus on noteworthy regions and generate attention-aware results. Lastly, we combine two procedures in an end-to-end manner. Experiments on two cross-modality semantic segmentation datasets demonstrate the effectiveness of our proposed framework. Specifically, JIFAAN surpasses the cutting-edge domain adaptation methods and achieves the state-of-the-art performance.
引用
收藏
页码:3665 / 3676
页数:12
相关论文
共 46 条
  • [1] Deep learning with non-medical training used for chest pathology identification
    Bar, Yaniv
    Diamant, Idit
    Wolf, Lior
    Greenspan, Hayit
    [J]. MEDICAL IMAGING 2015: COMPUTER-AIDED DIAGNOSIS, 2015, 9414
  • [2] Bergamo Alessandro., 2011, NIPS, P2088
  • [3] Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks
    Bousmalis, Konstantinos
    Silberman, Nathan
    Dohan, David
    Erhan, Dumitru
    Krishnan, Dilip
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 95 - 104
  • [4] Unsupervised Bidirectional Cross-Modality Adaptation via Deeply Synergistic Image and Feature Alignment for Medical Image Segmentation
    Chen, Cheng
    Dou, Qi
    Chen, Hao
    Qin, Jing
    Heng, Pheng Ann
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (07) : 2494 - 2505
  • [5] Chen C, 2019, AAAI CONF ARTIF INTE, P865
  • [6] Semantic-Aware Generative Adversarial Nets for Unsupervised Domain Adaptation in Chest X-Ray Segmentation
    Chen, Cheng
    Dou, Qi
    Chen, Hao
    Heng, Pheng-Ann
    [J]. MACHINE LEARNING IN MEDICAL IMAGING: 9TH INTERNATIONAL WORKSHOP, MLMI 2018, 2018, 11046 : 143 - 151
  • [7] Attention to Scale: Scale-aware Semantic Image Segmentation
    Chen, Liang-Chieh
    Yang, Yi
    Wang, Jiang
    Xu, Wei
    Yuille, Alan L.
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3640 - 3649
  • [8] Anatomy-Regularized Representation Learning for Cross-Modality Medical Image Segmentation
    Chen, Xu
    Lian, Chunfeng
    Wang, Li
    Deng, Hannah
    Kuang, Tianshu
    Fung, Steve
    Gateno, Jaime
    Yap, Pew-Thian
    Xia, James J.
    Shen, Dinggang
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (01) : 274 - 285
  • [9] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [10] Patch-based segmentation using expert priors: Application to hippocampus and ventricle segmentation
    Coupe, Pierrick
    Manjon, Jose V.
    Fonov, Vladimir
    Pruessner, Jens
    Robles, Montserrat
    Collins, D. Louis
    [J]. NEUROIMAGE, 2011, 54 (02) : 940 - 954