PIG: Prompt Images Guidance for Night-Time Scene Parsing

被引:2
|
作者
Xie, Zhifeng [1 ,2 ,3 ]
Qiu, Rui [1 ]
Wang, Sen [4 ,5 ]
Tan, Xin [4 ,5 ]
Xie, Yuan [4 ,5 ]
Ma, Lizhuang [4 ,6 ]
机构
[1] Shanghai Univ, Dept Film & Televis Engn, Shanghai 200072, Peoples R China
[2] Shanghai Key Lab Comp Software Testing & Evaluatin, Shanghai 200072, Peoples R China
[3] Shanghai Engn Res Ctr Mot Picture Special Effects, Shanghai 200072, Peoples R China
[4] East China Normal Univ, Sch Comp Sci & Technol, Shanghai 200062, Peoples R China
[5] East China Normal Univ, Chongqing Inst, Chongqing 401120, Peoples R China
[6] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Training; Accuracy; Task analysis; Adaptation models; Semantics; Motion pictures; Knowledge engineering; Night-time vision; scene parsing; unsupervised domain adaptation; prompt learning;
D O I
10.1109/TIP.2024.3415963
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Night-time scene parsing aims to extract pixel-level semantic information in night images, aiding downstream tasks in understanding scene object distribution. Due to limited labeled night image datasets, unsupervised domain adaptation (UDA) has become the predominant method for studying night scenes. UDA typically relies on paired day-night image pairs to guide adaptation, but this approach hampers dataset construction and restricts generalization across night scenes in different datasets. Moreover, UDA, focusing on network architecture and training strategies, faces difficulties in handling classes with few domain similarities. In this paper, we leverage Prompt Images Guidance (PIG) to enhance UDA with supplementary night knowledge. We propose a Night-Focused Network (NFNet) to learn night-specific features from both target domain images and prompt images. To generate high-quality pseudo-labels, we propose Pseudo-label Fusion via Domain Similarity Guidance (FDSG). Classes with fewer domain similarities are predicted by NFNet, which excels in parsing night features, while classes with more domain similarities are predicted by UDA, which has rich labeled semantics. Additionally, we propose two data augmentation strategies: the Prompt Mixture Strategy (PMS) and the Alternate Mask Strategy (AMS), aimed at mitigating the overfitting of the NFNet to a few prompt images. We conduct extensive experiments on four night-time datasets: NightCity, NightCity+, Dark Zurich, and ACDC. The results indicate that utilizing PIG can enhance the parsing accuracy of UDA. The code is available at https://github.com/qiurui4shu/PIG.
引用
收藏
页码:3921 / 3934
页数:14
相关论文
共 28 条
  • [1] Boosting Night-Time Scene Parsing With Learnable Frequency
    Xie, Zhifeng
    Wang, Sen
    Xu, Ke
    Zhang, Zhizhong
    Tan, Xin
    Xie, Yuan
    Ma, Lizhuang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2386 - 2398
  • [2] Night-Time Scene Parsing With a Large Real Dataset
    Tan, Xin
    Xu, Ke
    Cao, Ying
    Zhang, Yiheng
    Ma, Lizhuang
    Lau, Rynson W. H.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9085 - 9098
  • [3] Lightweight and Efficient Multimodal Prompt Injection Network for Scene Parsing of Remote Sensing Scene Images
    Li, Yangzhen
    Zhou, Wujie
    Meng, Jiajun
    Yan, Weiqing
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [4] EFRNet: Efficient Feature Reconstructing Network for Real-Time Scene Parsing
    Li, Xin
    Yang, Fan
    Luo, Ao
    Jiao, Zhicheng
    Cheng, Hong
    Liu, Zicheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2852 - 2865
  • [5] RTLNet: Recursive Triple-Path Learning Network for Scene Parsing of RGB-D Images
    Yue, Yuchun
    Zhou, Wujie
    Lei, Jingsheng
    Yu, Lu
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 429 - 433
  • [6] Adjacent Bi-Hierarchical Network for Scene Parsing of Remote Sensing Images
    Ma, Jiabao
    Zhou, Wujie
    Lei, Jingsheng
    Yu, Lu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [7] A Real-Time Scene Parsing Network for Autonomous Maritime Transportation
    Zhou, Rundong
    Gao, Yulong
    Wang, Yang
    Xie, Xingxiang
    Zhao, Xiongwei
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [8] A Real-Time Scene Parsing Network for Autonomous Maritime Transportation
    Zhou, Rundong
    Gao, Yulong
    Wang, Yang
    Xie, Xingxiang
    Zhao, Xiongwei
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [9] Improving Night-Time Pedestrian Retrieval With Distribution Alignment and Contextual Distance
    Ye, Mang
    Cheng, Yi
    Lan, Xiangyuan
    Zhu, Hongyuan
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (01) : 615 - 624
  • [10] CEGFNet: Common Extraction and Gate Fusion Network for Scene Parsing of Remote Sensing Images
    Zhou, Wujie
    Jin, Jianhui
    Lei, Jingsheng
    Hwang, Jenq-Neng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60