Joint image and feature adaptative attention-aware networks for cross-modality semantic segmentation

被引：3

作者：

Zhong, Qihuang ^{[1
,2
,3
]}

Zeng, Fanzhou ^{[1
]}

Liao, Fei ^{[1
]}

Liu, Juhua ^{[2
,3
]}

Du, Bo ^{[3
,4
,5
]}

Shang, Jedi S. ^{[6
]}

机构：

[1] Wuhan Univ, Renmin Hosp, Dept Gastroenterol, Wuhan, Peoples R China

[2] Wuhan Univ, Sch Printing & Packaging, Wuhan, Peoples R China

[3] Wuhan Univ, Inst Artificial Intelligence, Natl Engn Res Ctr Multimedia Software, Wuhan, Peoples R China

[4] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China

[5] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China

[6] Thinvent Technol Co LTD, Nanchang, Jiangxi, Peoples R China

来源：

NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Domain adaptation; Attention; Cross-modality; Semantic segmentation; AUTOMATED SEGMENTATION; PATCH;

D O I：

10.1007/s00521-021-06064-w

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning-based methods are widely used for the task of semantic segmentation in recent years. However, due to the difficulty and labor cost of collecting pixel-level annotations, it is hard to acquire sufficient training images for a certain imaging modality, which greatly hinders the performance of these methods. The intuitive solution to this issue is to train a pre-trained model on label-rich imaging modality (source domain) and then apply the pre-trained model to the label-poor imaging modality (target domain). Unsurprisingly, since the severe domain shift between different modalities, the pre-trained model would perform poorly on the target imaging modality. To this end, we propose a novel unsupervised domain adaptation framework, called Joint Image and Feature Adaptive Attention-aware Networks (JIFAAN), to alleviate the domain shift for cross-modality semantic segmentation. The proposed framework mainly consists of two procedures. The first procedure is image adaptation, which transforms the source domain images into target-like images using the adversarial learning with cycle-consistency constraint. For further bridging the gap between transformed images and target domain images, the second procedure employs feature adaptation to extract the domain-invariant features and thus aligns the distribution in feature space. In particular, we introduce an attention module in the feature adaptation to focus on noteworthy regions and generate attention-aware results. Lastly, we combine two procedures in an end-to-end manner. Experiments on two cross-modality semantic segmentation datasets demonstrate the effectiveness of our proposed framework. Specifically, JIFAAN surpasses the cutting-edge domain adaptation methods and achieves the state-of-the-art performance.

引用

页码：3665 / 3676

页数：12

共 46 条

[1] Deep learning with non-medical training used for chest pathology identification
Bar, Yaniv
Diamant, Idit
Wolf, Lior
Greenspan, Hayit
[J]. MEDICAL IMAGING 2015: COMPUTER-AIDED DIAGNOSIS, 2015, 9414
[2] Bergamo Alessandro., 2011, NIPS, P2088
[3] Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks
Bousmalis, Konstantinos
Silberman, Nathan
Dohan, David
Erhan, Dumitru
Krishnan, Dilip
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 95 - 104
[4] Unsupervised Bidirectional Cross-Modality Adaptation via Deeply Synergistic Image and Feature Alignment for Medical Image Segmentation
Chen, Cheng
Dou, Qi
Chen, Hao
Qin, Jing
Heng, Pheng Ann
[J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (07) : 2494 - 2505
[5] Chen C, 2019, AAAI CONF ARTIF INTE, P865
[6] Semantic-Aware Generative Adversarial Nets for Unsupervised Domain Adaptation in Chest X-Ray Segmentation
Chen, Cheng
Dou, Qi
Chen, Hao
Heng, Pheng-Ann
[J]. MACHINE LEARNING IN MEDICAL IMAGING: 9TH INTERNATIONAL WORKSHOP, MLMI 2018, 2018, 11046 : 143 - 151
[7] Attention to Scale: Scale-aware Semantic Image Segmentation
Chen, Liang-Chieh
Yang, Yi
Wang, Jiang
Xu, Wei
Yuille, Alan L.
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3640 - 3649
[8] Anatomy-Regularized Representation Learning for Cross-Modality Medical Image Segmentation
Chen, Xu
Lian, Chunfeng
Wang, Li
Deng, Hannah
Kuang, Tianshu
Fung, Steve
Gateno, Jaime
Yap, Pew-Thian
Xia, James J.
Shen, Dinggang
[J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (01) : 274 - 285
[9] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
[10] Patch-based segmentation using expert priors: Application to hippocampus and ventricle segmentation
Coupe, Pierrick
Manjon, Jose V.
Fonov, Vladimir
Pruessner, Jens
Robles, Montserrat
Collins, D. Louis
[J]. NEUROIMAGE, 2011, 54 (02) : 940 - 954

← 1 2 3 4 5 →