PIG: Prompt Images Guidance for Night-Time Scene Parsing

被引：2

作者：

Xie, Zhifeng ^{[1
,2
,3
]}

Qiu, Rui ^{[1
]}

Wang, Sen ^{[4
,5
]}

Tan, Xin ^{[4
,5
]}

Xie, Yuan ^{[4
,5
]}

Ma, Lizhuang ^{[4
,6
]}

机构：

[1] Shanghai Univ, Dept Film & Televis Engn, Shanghai 200072, Peoples R China

[2] Shanghai Key Lab Comp Software Testing & Evaluatin, Shanghai 200072, Peoples R China

[3] Shanghai Engn Res Ctr Mot Picture Special Effects, Shanghai 200072, Peoples R China

[4] East China Normal Univ, Sch Comp Sci & Technol, Shanghai 200062, Peoples R China

[5] East China Normal Univ, Chongqing Inst, Chongqing 401120, Peoples R China

[6] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2024年 / 33卷

基金：

中国国家自然科学基金;

关键词：

Training; Accuracy; Task analysis; Adaptation models; Semantics; Motion pictures; Knowledge engineering; Night-time vision; scene parsing; unsupervised domain adaptation; prompt learning;

D O I：

10.1109/TIP.2024.3415963

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Night-time scene parsing aims to extract pixel-level semantic information in night images, aiding downstream tasks in understanding scene object distribution. Due to limited labeled night image datasets, unsupervised domain adaptation (UDA) has become the predominant method for studying night scenes. UDA typically relies on paired day-night image pairs to guide adaptation, but this approach hampers dataset construction and restricts generalization across night scenes in different datasets. Moreover, UDA, focusing on network architecture and training strategies, faces difficulties in handling classes with few domain similarities. In this paper, we leverage Prompt Images Guidance (PIG) to enhance UDA with supplementary night knowledge. We propose a Night-Focused Network (NFNet) to learn night-specific features from both target domain images and prompt images. To generate high-quality pseudo-labels, we propose Pseudo-label Fusion via Domain Similarity Guidance (FDSG). Classes with fewer domain similarities are predicted by NFNet, which excels in parsing night features, while classes with more domain similarities are predicted by UDA, which has rich labeled semantics. Additionally, we propose two data augmentation strategies: the Prompt Mixture Strategy (PMS) and the Alternate Mask Strategy (AMS), aimed at mitigating the overfitting of the NFNet to a few prompt images. We conduct extensive experiments on four night-time datasets: NightCity, NightCity+, Dark Zurich, and ACDC. The results indicate that utilizing PIG can enhance the parsing accuracy of UDA. The code is available at https://github.com/qiurui4shu/PIG.

引用

页码：3921 / 3934

页数：14

共 28 条

[1] Boosting Night-Time Scene Parsing With Learnable Frequency
Xie, Zhifeng
Wang, Sen
Xu, Ke
Zhang, Zhizhong
Tan, Xin
Xie, Yuan
Ma, Lizhuang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2386 - 2398
[2] Night-Time Scene Parsing With a Large Real Dataset
Tan, Xin
Xu, Ke
Cao, Ying
Zhang, Yiheng
Ma, Lizhuang
Lau, Rynson W. H.
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9085 - 9098
[3] Lightweight and Efficient Multimodal Prompt Injection Network for Scene Parsing of Remote Sensing Scene Images
Li, Yangzhen
Zhou, Wujie
Meng, Jiajun
Yan, Weiqing
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[4] EFRNet: Efficient Feature Reconstructing Network for Real-Time Scene Parsing
Li, Xin
Yang, Fan
Luo, Ao
Jiao, Zhicheng
Cheng, Hong
Liu, Zicheng
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2852 - 2865
[5] RTLNet: Recursive Triple-Path Learning Network for Scene Parsing of RGB-D Images
Yue, Yuchun
Zhou, Wujie
Lei, Jingsheng
Yu, Lu
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 429 - 433
[6] Adjacent Bi-Hierarchical Network for Scene Parsing of Remote Sensing Images
Ma, Jiabao
Zhou, Wujie
Lei, Jingsheng
Yu, Lu
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[7] A Real-Time Scene Parsing Network for Autonomous Maritime Transportation
Zhou, Rundong
Gao, Yulong
Wang, Yang
Xie, Xingxiang
Zhao, Xiongwei
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
[8] A Real-Time Scene Parsing Network for Autonomous Maritime Transportation
Zhou, Rundong
Gao, Yulong
Wang, Yang
Xie, Xingxiang
Zhao, Xiongwei
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
[9] Improving Night-Time Pedestrian Retrieval With Distribution Alignment and Contextual Distance
Ye, Mang
Cheng, Yi
Lan, Xiangyuan
Zhu, Hongyuan
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (01) : 615 - 624
[10] CEGFNet: Common Extraction and Gate Fusion Network for Scene Parsing of Remote Sensing Images
Zhou, Wujie
Jin, Jianhui
Lei, Jingsheng
Hwang, Jenq-Neng
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

← 1 2 3 →