Spatial Continuity and Nonequal Importance in Salient Object Detection With Image-Category Supervision

被引:0
|
作者
Wu, Zhihao [1 ]
Liu, Chengliang [1 ]
Wen, Jie [1 ]
Xu, Yong [1 ,2 ]
Yang, Jian [3 ]
Li, Xuelong [4 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen 518055, Peoples R China
[2] Pengcheng Lab, Shenzhen 518055, Peoples R China
[3] Nanjing Univ Sci & Technol, Dept Comp Sci & Engn, Nanjing 210094, Peoples R China
[4] Northwestern Polytech Univ, Sch Artificial Intelligence OPt & Elect iOPEN, Xian 710072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Noise; Detectors; Transformers; Robustness; Object detection; Training; Feature extraction; Benchmark; robustness; salient object detection; weak supervision;
D O I
10.1109/TNNLS.2024.3436519
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the inefficiency of pixel-level annotations, weakly supervised salient object detection with image-category labels (WSSOD) has been receiving increasing attention. Previous works usually endeavor to generate high-quality pseudolabels to train the detectors in a fully supervised manner. However, we find that the detection performance is often limited by two types of noise contained in pseudolabels: 1) holes inside the object or at the edge and outliers in the background and 2) missing object portions and redundant surrounding regions. To mitigate the adverse effects caused by them, we propose local pixel correction (LPC) and key pixel attention (KPA), respectively, based on two key properties of desirable pseudolabels: 1) spatial continuity, meaning an object region consists of a cluster of adjacent points; and 2) nonequal importance, meaning pixels have different importance for training. Specifically, LPC fills holes and filters out outliers based on summary statistics of the neighborhood as well as its size. KPA directs the focus of training toward ambiguous pixels in multiple pseudolabels to discover more accurate saliency cues. To evaluate the effectiveness of our method, we design a simple yet strong baseline we call weakly supervised saliency detector with Transformer (WSSDT) and unify the proposed modules into WSSDT. Extensive experiments on five datasets demonstrate that our method significantly improves the baseline and outperforms all existing congeneric methods. Moreover, we establish the first benchmark to evaluate WSSOD robustness. The results show that our method can improve detection robustness as well. The code and robustness benchmark are available at https://github.com/Horatio9702/SCNI.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Backtracking Spatial Pyramid Pooling-Based Image Classifier for Weakly Supervised Top-Down Salient Object Detection
    Cholakkal, Hisham
    Johnson, Jubin
    Rajan, Deepu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (12) : 6064 - 6078
  • [22] Spatial attention guided cGAN for improved salient object detection
    Dhara, Gayathri
    Kumar, Ravi Kant
    FRONTIERS IN COMPUTER SCIENCE, 2024, 6
  • [23] Optimisation for image salient object detection based on semantic-aware clustering and CRF
    Chen, Junhao
    Niu, Yuzhe
    Wu, Jianbin
    Chen, Junrong
    IET COMPUTER VISION, 2020, 14 (02) : 49 - 58
  • [24] Salient Object Detection Based on Regional Contrast and Relative Spatial Compactness
    Xu, Dan
    Tang, Zhenmin
    Xu, Wei
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2013, 7 (11): : 2737 - 2753
  • [25] Exploring Image Enhancement for Salient Object Detection in Low Light Images
    Xu, Xin
    Wang, Shiqin
    Wang, Zheng
    Zhang, Xiaolong
    Hu, Ruimin
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (01)
  • [26] Deep Neural Network Based Salient Object Detection with Image Enhancement
    Zhou, Lecheng
    Gu, Xiaodong
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV, 2018, 11304 : 444 - 453
  • [27] Salient Object Detection with Fusion of RGB Image and Eye Tracking Data
    Wu, Yage
    Chen, Yadi
    Zhang, Rui
    HUMAN BRAIN AND ARTIFICIAL INTELLIGENCE, HBAI 2022, 2023, 1692 : 86 - 98
  • [28] Fitting-based optimisation for image visual salient object detection
    Niu, Yuzhen
    Lin, Wenqi
    Ke, Xiao
    Ke, Lingling
    IET COMPUTER VISION, 2017, 11 (02) : 161 - 172
  • [29] Spatial Attention Feedback Iteration for Lightweight Salient Object Detection in Optical Remote Sensing Images
    Luo, HuiLan
    Wang, JianQin
    Liang, BoCheng
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 13809 - 13823
  • [30] Lightweight Progressive Multilevel Feature Collaborative Network for Remote Sensing Image Salient Object Detection
    Cheng, Bei
    Liu, Zao
    Wang, Qingwang
    Shen, Tao
    Fu, Chengbiao
    Tian, Anhong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62