Cross-modal Unsupervised Domain Adaptation for 3D Semantic Segmentation via Bidirectional Fusion-then-Distillation

被引:1
作者
Wu, Yao [1 ]
Xing, Mingwei [2 ]
Zhang, Yachao [3 ]
Xie, Yuan [4 ,5 ]
Fan, Jianping [6 ]
Shi, Zhongchao [6 ]
Qu, Yanyun [2 ]
机构
[1] Xiamen Univ, Sch Informat, Xiamen, Peoples R China
[2] Xiamen Univ, Inst Artificial Intelligence, Xiamen, Peoples R China
[3] Tsinghua Univ, Shenzhen, Peoples R China
[4] East China Normal Univ, Shanghai, Peoples R China
[5] East China Normal Univ, Chongqing Inst, Chongqing, Peoples R China
[6] Lenovo Res, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
3D semantic segmentation; Unsupervised domain adaptation;
D O I
10.1145/3581783.3612013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-modal Unsupervised Domain Adaptation (UDA) becomes a research hotspot because it reduces the laborious annotation of target domain samples. Existing methods only mutually mimic the outputs of cross-modality in each domain, which enforces the class probability distribution agreeable in different domains. However, these methods ignore the complementarity brought by the modality fusion representation in cross-modal learning. In this paper, we propose a cross-modal UDA method for 3D semantic segmentation via Bidirectional Fusion-then-Distillation, named BFtD-xMUDA, which explores cross-modal fusion in UDA and realizes distribution consistency between outputs of two domains not only for 2D image and 3D point cloud but also for 2D/3D and fusion. Our method contains three significant components: Model-agnostic Feature Fusion Module (MFFM), Bidirectional Distillation (B-Distill), and Cross-modal Debiased Pseudo-Labeling (xDPL). MFFM is employed to generate cross-modal fusion features for establishing a latent space, which enforces maximum correlation and complementarity between two heterogeneous modalities. B-Distill is introduced to exploit bidirectional knowledge distillation which includes cross-modality and cross-domain fusion distillation, and well-achieving domain-modality alignment. xDPL is designed to model the uncertainty of pseudo-labels by self-training scheme. Extensive experimental results demonstrate that our method outperforms state-of-the-art competitors in several adaptation scenarios.
引用
收藏
页码:490 / 498
页数:9
相关论文
共 44 条
  • [21] Unsupervised domain adaptation multi-level adversarial network for semantic segmentation based on multi-modal features
    Wang Z.
    Bu S.
    Huang W.
    Zheng Y.
    Wu Q.
    Chang H.
    Zhang X.
    Tongxin Xuebao/Journal on Communications, 2022, 43 (12): : 157 - 171
  • [22] Unsupervised Domain Adaptation in Medical Image Segmentation via Fourier Feature Decoupling and Multi-teacher Distillation
    Hu, Wei
    Xu, Qiaozhi
    Qi, Xuanhao
    Yin, Yanjun
    Zhi, Min
    Lian, Zhe
    Yang, Na
    Duan, Wentao
    Yu, Lei
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14867 : 98 - 110
  • [23] PLS: UNSUPERVISED DOMAIN ADAPTATION FOR 3D OBJECT DETECTION VIA PSEUDO-LABEL SIZES
    Chen, Shijie
    Wang, Rongquan
    Li, Xin
    Wu, Yuchen
    Liu, Haizhuang
    Chen, Jiansheng
    Ma, Huimin
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 6370 - 6374
  • [24] UDA-KB: Unsupervised Domain Adaptation RGB-Thermal Semantic Segmentation via Knowledge Bridge
    Guo, Yuanhui
    Ni, Rongrong
    Yu, Zhitao
    Yang, Biao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 61 - 74
  • [25] Unsupervised Domain Adaptation for Vestibular Schwannoma and Cochlea Segmentation via Semi-supervised Learning and Label Fusion
    Liu, Han
    Fan, Yubo
    Cui, Can
    Su, Dingjie
    McNeil, Andrew
    Dawant, Benoit M.
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2021, PT II, 2022, 12963 : 529 - 539
  • [26] Unsupervised Bidirectional Cross-Modality Adaptation via Deeply Synergistic Image and Feature Alignment for Medical Image Segmentation
    Chen, Cheng
    Dou, Qi
    Chen, Hao
    Qin, Jing
    Heng, Pheng Ann
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (07) : 2494 - 2505
  • [27] 3D Domain Adaptive Instance Segmentation via Cyclic Segmentation GANs
    Lauenburg, Leander
    Lin, Zudi
    Zhang, Ruihan
    dos Santos, Marcia
    Huang, Siyu
    Arganda-Carreras, Ignacio
    Boyden, Edward S.
    Pfister, Hanspeter
    Wei, Donglai
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (08) : 4018 - 4027
  • [28] A Structure-Aware Framework of Unsupervised Cross-Modality Domain Adaptation via Frequency and Spatial Knowledge Distillation
    Liu, Shaolei
    Yin, Siqi
    Qu, Linhao
    Wang, Manning
    Song, Zhijian
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (12) : 3919 - 3931
  • [29] ACCURATE 3D KIDNEY SEGMENTATION USING UNSUPERVISED DOMAIN TRANSLATION AND ADVERSARIAL NETWORKS
    Zeng, Wankang
    Fan, Wenkang
    Chen, Rong
    Zheng, Zhuohui
    Zheng, Song
    Chen, Jianhui
    Liu, Rong
    Zeng, Qiang
    Liu, Zengqin
    Chen, Yinran
    Luo, Xiongbiao
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 598 - 602
  • [30] AdaptDiff: Cross-Modality Domain Adaptation via Weak Conditional Semantic Diffusion for Retinal Vessel Segmentation
    Hu, Dewei
    Li, Hao
    Liu, Han
    Wang, Jiacheng
    Yao, Xing
    Lu, Daiwei
    Oguz, Ipek
    SIMULATION AND SYNTHESIS IN MEDICAL IMAGING, SASHIMI 2024, 2025, 15187 : 13 - 23