Spatial Continuity and Nonequal Importance in Salient Object Detection With Image-Category Supervision

被引:0
|
作者
Wu, Zhihao [1 ]
Liu, Chengliang [1 ]
Wen, Jie [1 ]
Xu, Yong [1 ,2 ]
Yang, Jian [3 ]
Li, Xuelong [4 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen 518055, Peoples R China
[2] Pengcheng Lab, Shenzhen 518055, Peoples R China
[3] Nanjing Univ Sci & Technol, Dept Comp Sci & Engn, Nanjing 210094, Peoples R China
[4] Northwestern Polytech Univ, Sch Artificial Intelligence OPt & Elect iOPEN, Xian 710072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Noise; Detectors; Transformers; Robustness; Object detection; Training; Feature extraction; Benchmark; robustness; salient object detection; weak supervision;
D O I
10.1109/TNNLS.2024.3436519
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the inefficiency of pixel-level annotations, weakly supervised salient object detection with image-category labels (WSSOD) has been receiving increasing attention. Previous works usually endeavor to generate high-quality pseudolabels to train the detectors in a fully supervised manner. However, we find that the detection performance is often limited by two types of noise contained in pseudolabels: 1) holes inside the object or at the edge and outliers in the background and 2) missing object portions and redundant surrounding regions. To mitigate the adverse effects caused by them, we propose local pixel correction (LPC) and key pixel attention (KPA), respectively, based on two key properties of desirable pseudolabels: 1) spatial continuity, meaning an object region consists of a cluster of adjacent points; and 2) nonequal importance, meaning pixels have different importance for training. Specifically, LPC fills holes and filters out outliers based on summary statistics of the neighborhood as well as its size. KPA directs the focus of training toward ambiguous pixels in multiple pseudolabels to discover more accurate saliency cues. To evaluate the effectiveness of our method, we design a simple yet strong baseline we call weakly supervised saliency detector with Transformer (WSSDT) and unify the proposed modules into WSSDT. Extensive experiments on five datasets demonstrate that our method significantly improves the baseline and outperforms all existing congeneric methods. Moreover, we establish the first benchmark to evaluate WSSOD robustness. The results show that our method can improve detection robustness as well. The code and robustness benchmark are available at https://github.com/Horatio9702/SCNI.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Trigonometric feature learning for RGBD and RGBT image salient object detection
    Huang, Liming
    Gong, Aojun
    KNOWLEDGE-BASED SYSTEMS, 2025, 310
  • [32] APNet: Adversarial Learning Assistance and Perceived Importance Fusion Network for All-Day RGB-T Salient Object Detection
    Zhou, Wujie
    Zhu, Yun
    Lei, Jingsheng
    Wan, Jian
    Yu, Lu
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (04): : 957 - 968
  • [33] Spatial attention-guided deformable fusion network for salient object detection
    Aiping Yang
    Yan Liu
    Simeng Cheng
    Jiale Cao
    Zhong Ji
    Yanwei Pang
    Multimedia Systems, 2023, 29 : 2563 - 2573
  • [34] Spatial attention-guided deformable fusion network for salient object detection
    Yang, Aiping
    Liu, Yan
    Cheng, Simeng
    Cao, Jiale
    Ji, Zhong
    Pang, Yanwei
    MULTIMEDIA SYSTEMS, 2023, 29 (05) : 2563 - 2573
  • [35] Polarization spatial and semantic learning lightweight network for underwater salient object detection
    Yang, Xiaowen
    Li, Qingwu
    Yu, Dabing
    Gao, Zheng
    Huo, Guanying
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (03)
  • [36] SALIENT OBJECT DETECTION FOR RGB-D IMAGE VIA SALIENCY EVOLUTION
    Guo, Jingfan
    Ren, Tongwei
    Bei, Jia
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [37] Category-Aware Saliency Enhance Learning Based on CLIP for Weakly Supervised Salient Object Detection
    Zhang, Yunde
    Zhang, Zhili
    Liu, Tianshan
    Kong, Jun
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [38] Category-Aware Saliency Enhance Learning Based on CLIP for Weakly Supervised Salient Object Detection
    Yunde Zhang
    Zhili Zhang
    Tianshan Liu
    Jun Kong
    Neural Processing Letters, 56
  • [39] Salient object detection using color spatial distribution and minimum spanning tree weight
    Tang, Chang
    Hou, Chunping
    Wang, Pichao
    Song, Zhanjie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (12) : 6963 - 6978
  • [40] Salient object detection using color spatial distribution and minimum spanning tree weight
    Chang Tang
    Chunping Hou
    Pichao Wang
    Zhanjie Song
    Multimedia Tools and Applications, 2016, 75 : 6963 - 6978