IDO: Instance dual-optimization for weakly supervised object detection

被引:0
|
作者
Ren, Zhida [1 ,2 ]
Tang, Yongqiang [1 ]
Zhang, Wensheng [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodel Artificial Intelligence, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Weakly supervised learning; Object detection; Multiple instance learning;
D O I
10.1007/s10489-023-04956-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised object detection (WSOD) has attracted significant attention in recent years, as it utilizes only image-level annotations to train object detectors and greatly reduces the labor and capital cost of fine labeling. Nevertheless, the absence of instance-level annotations leads to two phenomena: partial regions and missing instances. We believe these are mainly caused by two issues: 1) Noisy instances exist in the training samples, which can confuse the detector. 2) Global salient information is missing, resulting in little attention being received in the low-confidence region. To solve the above two problems, we propose an instance dual-optimization framework called IDO. First, an instance-wise selection strategy (IWSS) based on curriculum learning is proposed for instance denoising and for improving the robustness of the model. Second, CAM-generated spatial attention (CGSA) is carefully designed to optimize the features of instances. Without introducing additional hyperparameters, our CGSA complements the low class-confidence region with more global salient information, which assists the model in acquiring a more complete region of the target and identifying more neglected targets. Finally, we empirically demonstrate that our proposal can achieve comparable results to those of other state-of-the-art methods on PASCAL VOC 2007, PASCAL VOC 2012, and MS COCO.
引用
收藏
页码:26763 / 26780
页数:18
相关论文
共 50 条
  • [31] Weakly Supervised Object Detection with Symmetry Context
    Gu, Xinyu
    Zhang, Qian
    Lu, Zheng
    SYMMETRY-BASEL, 2022, 14 (09):
  • [32] Forget and Diversify: Regularized Refinement for Weakly Supervised Object Detection
    Son, Jeany
    Kim, Daniel
    Lee, Solae
    Kwak, Suha
    Cho, Minsu
    Han, Bohyung
    COMPUTER VISION - ACCV 2018, PT IV, 2019, 11364 : 632 - 648
  • [33] ITNet: Low-Shot Instance Transformation Network for Weakly Supervised Object Detection in Remote Sensing Images
    Liu, Peng
    Pan, Zongxu
    Lei, Bin
    Hu, Yuxin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 13
  • [34] Weakly Supervised Object Detection for Remote Sensing Images via Progressive Image-Level and Instance-Level Feature Refinement
    Zheng, Shangdong
    Wu, Zebin
    Xu, Yang
    Wei, Zhihui
    REMOTE SENSING, 2024, 16 (07)
  • [35] Min-Entropy Latent Model for Weakly Supervised Object Detection
    Wan, Fang
    Wei, Pengxu
    Han, Zhenjun
    Jiao, Jianbin
    Ye, Qixiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (10) : 2395 - 2409
  • [36] Multi⁃level Fusion Based Weakly Supervised Object Detection Network
    Cao, Huan
    Chen, Zengping
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2024, 37 (05): : 424 - 434
  • [37] Online Active Proposal Set Generation for weakly supervised object detection
    Jin, Ruibing
    Lin, Guosheng
    Wen, Changyun
    KNOWLEDGE-BASED SYSTEMS, 2022, 237
  • [38] Negative Prototypes Guided Contrastive Learning for Weakly Supervised Object Detection
    Zhang, Yu
    Zhu, Chuang
    Yang, Guoqing
    Chen, Siqi
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT II, 2023, 14170 : 36 - 51
  • [39] Dynamic proposal sampling for weakly supervised object detection
    Jiang, Wenhui
    Zhao, Zhicheng
    Su, Fei
    Fang, Yuming
    NEUROCOMPUTING, 2021, 441 : 248 - 259
  • [40] Weakly Supervised Cell Instance Segmentation by Propagating from Detection Response
    Nishimura, Kazuya
    Ker, Dai Fei Elmer
    Bise, Ryoma
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT I, 2019, 11764 : 649 - 657