Cross-modal Unsupervised Domain Adaptation for 3D Semantic Segmentation via Bidirectional Fusion-then-Distillation

被引：1

作者：

Wu, Yao ^{[1
]}

Xing, Mingwei ^{[2
]}

Zhang, Yachao ^{[3
]}

Xie, Yuan ^{[4
,5
]}

Fan, Jianping ^{[6
]}

Shi, Zhongchao ^{[6
]}

Qu, Yanyun ^{[2
]}

机构：

[1] Xiamen Univ, Sch Informat, Xiamen, Peoples R China

[2] Xiamen Univ, Inst Artificial Intelligence, Xiamen, Peoples R China

[3] Tsinghua Univ, Shenzhen, Peoples R China

[4] East China Normal Univ, Shanghai, Peoples R China

[5] East China Normal Univ, Chongqing Inst, Chongqing, Peoples R China

[6] Lenovo Res, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

3D semantic segmentation; Unsupervised domain adaptation;

D O I：

10.1145/3581783.3612013

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Cross-modal Unsupervised Domain Adaptation (UDA) becomes a research hotspot because it reduces the laborious annotation of target domain samples. Existing methods only mutually mimic the outputs of cross-modality in each domain, which enforces the class probability distribution agreeable in different domains. However, these methods ignore the complementarity brought by the modality fusion representation in cross-modal learning. In this paper, we propose a cross-modal UDA method for 3D semantic segmentation via Bidirectional Fusion-then-Distillation, named BFtD-xMUDA, which explores cross-modal fusion in UDA and realizes distribution consistency between outputs of two domains not only for 2D image and 3D point cloud but also for 2D/3D and fusion. Our method contains three significant components: Model-agnostic Feature Fusion Module (MFFM), Bidirectional Distillation (B-Distill), and Cross-modal Debiased Pseudo-Labeling (xDPL). MFFM is employed to generate cross-modal fusion features for establishing a latent space, which enforces maximum correlation and complementarity between two heterogeneous modalities. B-Distill is introduced to exploit bidirectional knowledge distillation which includes cross-modality and cross-domain fusion distillation, and well-achieving domain-modality alignment. xDPL is designed to model the uncertainty of pseudo-labels by self-training scheme. Extensive experimental results demonstrate that our method outperforms state-of-the-art competitors in several adaptation scenarios.

引用

页码：490 / 498

页数：9

共 44 条

[21] Unsupervised domain adaptation multi-level adversarial network for semantic segmentation based on multi-modal features
Wang Z.
Bu S.
Huang W.
Zheng Y.
Wu Q.
Chang H.
Zhang X.
Tongxin Xuebao/Journal on Communications, 2022, 43 (12): : 157 - 171
[22] Unsupervised Domain Adaptation in Medical Image Segmentation via Fourier Feature Decoupling and Multi-teacher Distillation
Hu, Wei
Xu, Qiaozhi
Qi, Xuanhao
Yin, Yanjun
Zhi, Min
Lian, Zhe
Yang, Na
Duan, Wentao
Yu, Lei
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14867 : 98 - 110
[23] PLS: UNSUPERVISED DOMAIN ADAPTATION FOR 3D OBJECT DETECTION VIA PSEUDO-LABEL SIZES
Chen, Shijie
Wang, Rongquan
Li, Xin
Wu, Yuchen
Liu, Haizhuang
Chen, Jiansheng
Ma, Huimin
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 6370 - 6374
[24] UDA-KB: Unsupervised Domain Adaptation RGB-Thermal Semantic Segmentation via Knowledge Bridge
Guo, Yuanhui
Ni, Rongrong
Yu, Zhitao
Yang, Biao
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 61 - 74
[25] Unsupervised Domain Adaptation for Vestibular Schwannoma and Cochlea Segmentation via Semi-supervised Learning and Label Fusion
Liu, Han
Fan, Yubo
Cui, Can
Su, Dingjie
McNeil, Andrew
Dawant, Benoit M.
BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2021, PT II, 2022, 12963 : 529 - 539
[26] Unsupervised Bidirectional Cross-Modality Adaptation via Deeply Synergistic Image and Feature Alignment for Medical Image Segmentation
Chen, Cheng
Dou, Qi
Chen, Hao
Qin, Jing
Heng, Pheng Ann
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (07) : 2494 - 2505
[27] 3D Domain Adaptive Instance Segmentation via Cyclic Segmentation GANs
Lauenburg, Leander
Lin, Zudi
Zhang, Ruihan
dos Santos, Marcia
Huang, Siyu
Arganda-Carreras, Ignacio
Boyden, Edward S.
Pfister, Hanspeter
Wei, Donglai
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (08) : 4018 - 4027
[28] A Structure-Aware Framework of Unsupervised Cross-Modality Domain Adaptation via Frequency and Spatial Knowledge Distillation
Liu, Shaolei
Yin, Siqi
Qu, Linhao
Wang, Manning
Song, Zhijian
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (12) : 3919 - 3931
[29] ACCURATE 3D KIDNEY SEGMENTATION USING UNSUPERVISED DOMAIN TRANSLATION AND ADVERSARIAL NETWORKS
Zeng, Wankang
Fan, Wenkang
Chen, Rong
Zheng, Zhuohui
Zheng, Song
Chen, Jianhui
Liu, Rong
Zeng, Qiang
Liu, Zengqin
Chen, Yinran
Luo, Xiongbiao
2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 598 - 602
[30] AdaptDiff: Cross-Modality Domain Adaptation via Weak Conditional Semantic Diffusion for Retinal Vessel Segmentation
Hu, Dewei
Li, Hao
Liu, Han
Wang, Jiacheng
Yao, Xing
Lu, Daiwei
Oguz, Ipek
SIMULATION AND SYNTHESIS IN MEDICAL IMAGING, SASHIMI 2024, 2025, 15187 : 13 - 23

← 1 2 3 4 5 →