Deep Learning for Weakly-Supervised Object Detection and Localization: A Survey

被引:46
作者
Shao, Feifei [1 ]
Chen, Long [2 ]
Shao, Jian [1 ]
Ji, Wei [3 ]
Xiao, Shaoning [1 ]
Ye, Lu [4 ]
Zhuang, Yueting [1 ]
Xiao, Jun [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Hangzhou 310027, Peoples R China
[2] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
[3] Natl Univ Singapore, Sch Comp, Singapore 117417, Singapore
[4] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Hangzhou 310023, Peoples R China
基金
中国国家自然科学基金;
关键词
Weakly-supervised learning; Object detection and localization; Basic framework; Techniques; Future directions;
D O I
10.1016/j.neucom.2022.01.095
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly-Supervised Object Detection (WSOD) and Localization (WSOL), i.e., detecting multiple and single instances with bounding boxes in an image using image-level labels, are long-standing and challenging tasks in object detection. Hundreds of WSOD and WSOL methods and numerous techniques have been proposed in the deep learning era. To this end, in this paper, we consider WSOL as a sub-task of WSOD and provide a comprehensive survey of the recent achievements of WSOD. Specifically, we firstly describe the formulation and setting of the WSOD, including the background, challenges, basic framework. Meanwhile, we summarize and analyze all advanced techniques and training and test tricks for improving detection performance. Then, we introduce the widely-used datasets and evaluation metrics of WSOD. Lastly, we discuss the future directions of WSOD. We believe that these summaries can help pave a way for future research on WSOD and WSOL.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:192 / 207
页数:16
相关论文
共 133 条
[1]   Dissimilarity Coefficient based Weakly Supervised Object Detection [J].
Arun, Aditya ;
Jawahar, C., V ;
Kumar, M. Pawan .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9424-9433
[2]   Where to Look?: Mining Complementary Image Regions for Weakly Supervised Object Localization [J].
Babar, Sadbhavana ;
Das, Sukhendu .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :1009-1018
[3]  
Bae W., 2020, EUROPEAN C COMPUTER
[4]  
Benassou S.N., NEUROCOMPUTING
[5]   Weakly Supervised Deep Detection Networks [J].
Bilen, Hakan ;
Vedaldi, Andrea .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2846-2854
[6]  
Cao M., 2021, 2021 C EMPIRICAL MET
[7]  
Cao T., 2021, P IEEECVF INT C COMP
[8]   A Hierarchical Graph Network for 3D Object Detection on Point Clouds [J].
Chen, Jintai ;
Lei, Biwen ;
Song, Qingyu ;
Ying, Haochao ;
Chen, Danny Z. ;
Wu, Jian .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :389-398
[9]   Human-like Controllable Image Captioning with Verb-specific Semantic Roles [J].
Chen, Long ;
Jiang, Zhihong ;
Xiao, Jun ;
Liu, Wei .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :16841-16851
[10]  
Chen L, 2021, AAAI CONF ARTIF INTE, V35, P1036