Progressive Frame-Proposal Mining for Weakly Supervised Video Object Detection

被引：4

作者：

Han, Mingfei ^{[1
]}

Wang, Yali ^{[2
,3
]}

Li, Mingjie ^{[4
]}

Chang, Xiaojun ^{[1
]}

Yang, Yi ^{[5
]}

Qiao, Yu ^{[2
,3
]}

机构：

[1] Univ Technol Sydney, Australian Artificial Intelligence Inst, Fac Engn & Informat Technol, ReLER Lab, Ultimo, NSW 2007, Australia

[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China

[3] Shanghai Artificial Intelligence Lab, Shanghai 202150, Peoples R China

[4] Stanford Univ, Dept Radiat Oncol, Stanford, CA 94305 USA

[5] Zhejiang Univ, Sch Comp Sci, Hangzhou 310000, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2024年 / 33卷

基金：

澳大利亚研究理事会;

关键词：

Proposals; Object detection; Detectors; Annotations; Task analysis; Training; Benchmark testing; Video object detection; weakly supervised learning; holistic-view refinement;

D O I：

10.1109/TIP.2024.3364536

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we focus on the weakly supervised video object detection problem, where each training video is only tagged with object labels, without any bounding box annotations of objects. To effectively train object detectors from such weakly-annotated videos, we propose a Progressive Frame-Proposal Mining (PFPM) framework by exploiting discriminative proposals in a coarse-to-fine manner. First, we design a flexible Multi-Level Selection (MLS) scheme, with explicit guidance of video tags. By selecting object-relevant frames and mining important proposals from these frames, the proposed MLS can effectively reduce frame redundancy as well as improve proposal effectiveness to boost weakly-supervised detectors. Moreover, we develop a novel Holistic-View Refinement (HVR) scheme, which can globally evaluate importance of proposals among frames, and thus correctly refine pseudo ground truth boxes for training video detectors in a self-supervised manner. Finally, we evaluate the proposed PFPM on a large-scale benchmark for video object detection, on ImageNet VID, under the setting of weak annotations. The experimental results demonstrate that our PFPM significantly outperforms the state-of-the-art weakly-supervised detectors.

引用

页码：1560 / 1573

页数：14

共 50 条

[1] PCL: Proposal Cluster Learning for Weakly Supervised Object Detection
Tang, Peng
Wang, Xinggang
Bai, Song
Shen, Wei
Bai, Xiang
Liu, Wenyu
Yuille, Alan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (01) : 176 - 191
[2] Learning an Invariant and Equivariant Network for Weakly Supervised Object Detection
Feng, Xiaoxu
Yao, Xiwen
Shen, Hui
Cheng, Gong
Xiao, Bin
Han, Junwei
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 11977 - 11992
[3] Weakly Supervised Object Localization and Detection: A Survey
Zhang, Dingwen
Han, Junwei
Cheng, Gong
Yang, Ming-Hsuan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5866 - 5885
[4] Contrastive Proposal Extension With LSTM Network for Weakly Supervised Object Detection
Lv, Pei
Hu, Suqi
Hao, Tianran
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6879 - 6892
[5] Self-Guided Proposal Generation for Weakly Supervised Object Detection
Cheng, Gong
Xie, Xuan
Chen, Weining
Feng, Xiaoxu
Yao, Xiwen
Han, Junwei
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[6] Efficient Weakly-Supervised Object Detection With Pseudo Annotations
Yuan, Qingsheng
Sun, Gang
Liang, Jianming
Leng, Biao
IEEE ACCESS, 2021, 9 : 104356 - 104366
[7] Diverse Complementary Part Mining for Weakly Supervised Object Localization
Meng, Meng
Zhang, Tianzhu
Yang, Wenfei
Zhao, Jian
Zhang, Yongdong
Wu, Feng
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1774 - 1788
[8] Weakly Supervised Region Proposal Network and Object Detection
Tang, Peng
Wang, Xinggang
Wang, Angtian
Yan, Yongluan
Liu, Wenyu
Huang, Junzhou
Yuille, Alan
COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 : 370 - 386
[9] Weakly-Supervised Salient Object Detection With Saliency Bounding Boxes
Liu, Yuxuan
Wang, Pengjie
Cao, Ying
Liang, Zijian
Lau, Rynson W. H.
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4423 - 4435
[10] Progressive Representation Adaptation for Weakly Supervised Object Localization
Li, Dong
Huang, Jia-Bin
Li, Yali
Wang, Shengjin
Yang, Ming-Hsuan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (06) : 1424 - 1438

← 1 2 3 4 5 →