Progressive Frame-Proposal Mining for Weakly Supervised Video Object Detection

被引:4
|
作者
Han, Mingfei [1 ]
Wang, Yali [2 ,3 ]
Li, Mingjie [4 ]
Chang, Xiaojun [1 ]
Yang, Yi [5 ]
Qiao, Yu [2 ,3 ]
机构
[1] Univ Technol Sydney, Australian Artificial Intelligence Inst, Fac Engn & Informat Technol, ReLER Lab, Ultimo, NSW 2007, Australia
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[3] Shanghai Artificial Intelligence Lab, Shanghai 202150, Peoples R China
[4] Stanford Univ, Dept Radiat Oncol, Stanford, CA 94305 USA
[5] Zhejiang Univ, Sch Comp Sci, Hangzhou 310000, Peoples R China
基金
澳大利亚研究理事会;
关键词
Proposals; Object detection; Detectors; Annotations; Task analysis; Training; Benchmark testing; Video object detection; weakly supervised learning; holistic-view refinement;
D O I
10.1109/TIP.2024.3364536
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we focus on the weakly supervised video object detection problem, where each training video is only tagged with object labels, without any bounding box annotations of objects. To effectively train object detectors from such weakly-annotated videos, we propose a Progressive Frame-Proposal Mining (PFPM) framework by exploiting discriminative proposals in a coarse-to-fine manner. First, we design a flexible Multi-Level Selection (MLS) scheme, with explicit guidance of video tags. By selecting object-relevant frames and mining important proposals from these frames, the proposed MLS can effectively reduce frame redundancy as well as improve proposal effectiveness to boost weakly-supervised detectors. Moreover, we develop a novel Holistic-View Refinement (HVR) scheme, which can globally evaluate importance of proposals among frames, and thus correctly refine pseudo ground truth boxes for training video detectors in a self-supervised manner. Finally, we evaluate the proposed PFPM on a large-scale benchmark for video object detection, on ImageNet VID, under the setting of weak annotations. The experimental results demonstrate that our PFPM significantly outperforms the state-of-the-art weakly-supervised detectors.
引用
收藏
页码:1560 / 1573
页数:14
相关论文
共 50 条
  • [1] PCL: Proposal Cluster Learning for Weakly Supervised Object Detection
    Tang, Peng
    Wang, Xinggang
    Bai, Song
    Shen, Wei
    Bai, Xiang
    Liu, Wenyu
    Yuille, Alan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (01) : 176 - 191
  • [2] Learning an Invariant and Equivariant Network for Weakly Supervised Object Detection
    Feng, Xiaoxu
    Yao, Xiwen
    Shen, Hui
    Cheng, Gong
    Xiao, Bin
    Han, Junwei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 11977 - 11992
  • [3] Weakly Supervised Object Localization and Detection: A Survey
    Zhang, Dingwen
    Han, Junwei
    Cheng, Gong
    Yang, Ming-Hsuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5866 - 5885
  • [4] Contrastive Proposal Extension With LSTM Network for Weakly Supervised Object Detection
    Lv, Pei
    Hu, Suqi
    Hao, Tianran
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6879 - 6892
  • [5] Self-Guided Proposal Generation for Weakly Supervised Object Detection
    Cheng, Gong
    Xie, Xuan
    Chen, Weining
    Feng, Xiaoxu
    Yao, Xiwen
    Han, Junwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [6] Efficient Weakly-Supervised Object Detection With Pseudo Annotations
    Yuan, Qingsheng
    Sun, Gang
    Liang, Jianming
    Leng, Biao
    IEEE ACCESS, 2021, 9 : 104356 - 104366
  • [7] Diverse Complementary Part Mining for Weakly Supervised Object Localization
    Meng, Meng
    Zhang, Tianzhu
    Yang, Wenfei
    Zhao, Jian
    Zhang, Yongdong
    Wu, Feng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1774 - 1788
  • [8] Weakly Supervised Region Proposal Network and Object Detection
    Tang, Peng
    Wang, Xinggang
    Wang, Angtian
    Yan, Yongluan
    Liu, Wenyu
    Huang, Junzhou
    Yuille, Alan
    COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 : 370 - 386
  • [9] Weakly-Supervised Salient Object Detection With Saliency Bounding Boxes
    Liu, Yuxuan
    Wang, Pengjie
    Cao, Ying
    Liang, Zijian
    Lau, Rynson W. H.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4423 - 4435
  • [10] Progressive Representation Adaptation for Weakly Supervised Object Localization
    Li, Dong
    Huang, Jia-Bin
    Li, Yali
    Wang, Shengjin
    Yang, Ming-Hsuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (06) : 1424 - 1438