SAFENet: Semantic-Aware Feature Enhancement Network for unsupervised cross-domain road scene segmentation

被引:0
|
作者
Ren, Dexin [1 ,2 ]
Li, Minxian [1 ,2 ]
Wang, Shidong [3 ]
Ren, Mingwu [1 ,2 ]
Zhang, Haofeng [1 ,2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Nanjing Univ Sci & Technol, State Key Lab Intelligent Mfg Adv Construct Machin, Nanjing 210094, Peoples R China
[3] Newcastle Univ, Sch Engn, Newcastle Upon Tyne NE1 7RU, England
基金
中国国家自然科学基金;
关键词
Unsupervised domain adaptation; Semantic segmentation; Semantic-Aware Feature Enhancement; Adaptive instance normalization; Knowledge transfer;
D O I
10.1016/j.imavis.2024.105318
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised cross-domain road scene segmentation has attracted substantial interest because of its capability to perform segmentation on new and unlabeled domains, thereby reducing the dependence on expensive manual annotations. This is achieved by leveraging networks trained on labeled source domains to classify images on unlabeled target domains. Conventional techniques usually use adversarial networks to align inputs from the source and the target in either of their domains. However, these approaches often fall short in effectively integrating information from both domains due to Alignment in each space usually leads to bias problems during feature learning. To overcome these limitations and enhance cross-domain interaction while mitigating overfitting to the source domain, we introduce a novel framework called Semantic-Aware Feature Enhancement Network (SAFENet) for Unsupervised Cross-domain Road Scene Segmentation. SAFENet incorporates the Semantic-Aware Enhancement (SAE) module to amplify the importance of class information in segmentation tasks and uses the semantic space as anew domain to guide the alignment of the source and target domains. Additionally, we integrate Adaptive Instance Normalization with Momentum (AdaINM) techniques, which convert the source domain image style to the target domain image style, thereby reducing the adverse effects of source domain overfitting on target domain segmentation performance. Moreover, SAFENet employs a Knowledge Transfer (KT) module to optimize network architecture, enhancing computational efficiency during testing while maintaining the robust inference capabilities developed during training. To further improve the segmentation performance, we further employ Curriculum Learning, a self- training mechanism that uses pseudo-labels derived from the target domain to iteratively refine the network. Comprehensive experiments on three well-known datasets, "Synthia -> Cityscapes"and "GTA5 -> Cityscapes", demonstrate the superior performance of our method. In-depth examinations and ablation studies verify the efficacy of each module within the proposed method.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] HSNet: An Intelligent Hierarchical Semantic-Aware Network System for Real-Time Semantic Segmentation
    Peng, Xin
    Cheng, Jieren
    Tang, Xiangyan
    Deng, Ziqi
    Tu, Wenxuan
    Xiong, Neal
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (07): : 4318 - 4330
  • [32] Multi-Scale Depth-Aware Unsupervised Domain Adaption in Semantic Segmentation
    Xing, Congying
    Zhang, Lefei
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [33] Self-Ensembling GAN for Cross-Domain Semantic Segmentation
    Xu, Yonghao
    He, Fengxiang
    Du, Bo
    Tao, Dacheng
    Zhang, Liangpei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7837 - 7850
  • [34] Prototypical Bidirectional Adaptation and Learning for Cross-Domain Semantic Segmentation
    Ren, Qinghua
    Mao, Qirong
    Lu, Shijian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 501 - 513
  • [35] CCANet: Cross-Modality Comprehensive Feature Aggregation Network for Indoor Scene Semantic Segmentation
    Zhang, Zihao
    Yang, Yale
    Hou, Huifang
    Meng, Fanman
    Zhang, Fan
    Xie, Kangzhan
    Zhuang, Chunsheng
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2025, 17 (02) : 366 - 378
  • [36] Unsupervised domain adaptation for mobile semantic segmentation based on cycle consistency and feature alignment
    Toldo, Marco
    Michieli, Umberto
    Agresti, Gianluca
    Zanuttigh, Pietro
    IMAGE AND VISION COMPUTING, 2020, 95
  • [37] Weakly-Supervised Cross-Domain Road Scene Segmentation via Multi-Level Curriculum Adaptation
    Lv, Fengmao
    Lin, Guosheng
    Liu, Peng
    Yang, Guowu
    Pan, Sinno Jialin
    Duan, Lixin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (09) : 3493 - 3503
  • [38] PANDA: A Polarized Attention Network for Enhanced Unsupervised Domain Adaptation in Semantic Segmentation
    Kao, Chiao-Wen
    Chang, Wei-Ling
    Lee, Chun-Chieh
    Fan, Kuo-Chin
    ELECTRONICS, 2024, 13 (21)
  • [39] Enhanced Target Domain Representation Based Unsupervised Cross-Domain Medical Image Segmentation
    Liu, Kai
    Lu, Runuo
    Zheng, Xiaorou
    Dong, Shoubin
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2024, 37 (09): : 755 - 769
  • [40] Lightweight and Efficient Convolutional Neural Network for Road Scene Semantic Segmentation
    Kherraki, Amine
    Maqbool, Muaz
    El Ouazzani, Rajae
    2022 IEEE 18TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING, ICCP, 2022, : 135 - 139