CAFA: Cross-Modal Attentive Feature Alignment for Cross-Domain Urban Scene Segmentation

被引：1

作者：

Liu, Peng ^{[1
]}

Ge, Yanqi ^{[2
]}

Duan, Lixin ^{[1
,3
]}

Li, Wen ^{[2
]}

Lv, Fengmao ^{[4
,5
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China

[2] Univ Elect Sci & Technol China, Shenzhen Inst Adv Study, Shenzhen 518110, Peoples R China

[3] Univ Elect Sci & Technol China, Sichuan Prov Peoples Hosp, Chengdu 610032, Peoples R China

[4] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China

[5] Minist Educ, Engn Res Ctr Sustainable Urban Intelligent Transp, Chengdu 611756, Peoples R China

来源：

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS | 2024年 / 20卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Task analysis; Semantic segmentation; Feature extraction; Training; Transformers; Estimation; Adaptation models; Autonomous vehicles; domain adaptation; semantic segmentation;

D O I：

10.1109/TII.2024.3412006

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Autonomous driving systems rely heavily on semantic segmentation models for accurate and safe decision-making. High segmentation performance in real-world urban scenes is crucial for autonomous vehicles, while substantial pixel-level labels are required during model training. Unsupervised domain adaptation (UDA) techniques are widely used to adapt the segmentation model trained on the synthetic data (i.e., source domain) to the real-world data (i.e., target domain) since obtaining pixel-level annotations is fairly easy in the synthetic environment. Recently, increasing UDA approaches promote cross-domain semantic segmentation (CDSS) by fusing the depth information into the RGB features. However, feature fusion does not necessarily eliminate the domain-specific components in the RGB features, which can result in the features still being influenced by domain-specific information. To address this, we propose a novel cross-modal attentive feature alignment (CAFA) framework for CDSS, which provides an explicit perspective of using depth information to align the main backbone RGB features of both domains in a nonadversarial manner. In particular, considering that the depth modality is less affected by the domain gap, we employ depth as an intermediate modality and align the RGB features by attending RGB features to the depth modality through constructing an auxiliary multimodal segmentation task. The state-of-the-art performance of our CAFA can be achieved on benchmark tasks, such as Synthia -> Cityscapes and grand theft auto (GTA) -> Cityscapes.

引用

页码：11666 / 11675

页数：10

共 50 条

[21] Preserving Label-Related Domain-Specific Information for Cross-Domain Semantic Segmentation
Liao, Muxin
Tian, Shishun
Zhang, Yuhang
Hua, Guoguang
Zou, Wenbin
Li, Xia
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (10) : 14917 - 14931
[22] CONTRAST UNCERTAINTY DOMAIN ALIGNMENT FOR CROSS-DOMAIN PANCREATIC IMAGE SEGMENTATION
Fan, Ligang
Bian, Yun
Zhu, Weifang
Shi, Fei
Chen, Xinjian
Shao, Chengwei
Xiang, Dehui
2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
[23] Multisource Compensation Network for Remote Sensing Cross-Domain Scene Classification
Lu, Xiaoqiang
Gong, Tengfei
Zheng, Xiangtao
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (04): : 2504 - 2515
[24] Cross-Domain Rumor Detection based on Dual-Modal Domain Alignment
Liu, Danni
Liu, Bo
Chen, Yida
Wu, Wanmeng
Cao, Jiuxin
Hou, Yiwen
2024 9TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, ICSIP, 2024, : 544 - 548
[25] Cross-Domain Multilevel Feature Adaptive Alignment R-CNN for Insulator Defect Detection in Transmission Lines
Wang, Yaru
Qu, Zhuo
Hu, Zhedong
Yang, Chunwang
Huang, Xiaoguang
Zhao, Zhenbing
Zhai, Yongjie
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
[26] Cross-Modal Progressive Comprehension for Referring Segmentation
Liu, Si
Hui, Tianrui
Huang, Shaofei
Wei, Yunchao
Li, Bo
Li, Guanbin
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 4761 - 4775
[27] Feature and Joint Distribution Migration Alignment Method for Cross-Domain Fault Diagnosis of Rotating Machinery
Zhang, Yazhou
Zhao, Xiaoqiang
Xu, Rongrong
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
[28] Unsupervised Cross-Domain Fault Diagnosis Using Feature Representation Alignment Networks for Rotating Machinery
Chen, Jiahong
Wang, Jing
Zhu, Jianxin
Lee, Tong Heng
de Silva, Clarence W.
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2021, 26 (05) : 2770 - 2781
[29] Category-Level Assignment for Cross-Domain Semantic Segmentation in Remote Sensing Images
Ni, Huan
Liu, Qingshan
Guan, Haiyan
Tang, Hong
Chanussot, Jocelyn
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[30] Cross-Domain Fault Diagnosis of Rotating Machinery Using Discriminative Feature Attention Network
Jang, Gye-Bong
Kim, Jin-Young
Cho, Sung-Bae
IEEE ACCESS, 2021, 9 : 99781 - 99793

← 1 2 3 4 5 →