Deep Learning for Generic Object Detection: A Survey

被引:1832
作者
Liu, Li [1 ,2 ]
Ouyang, Wanli [3 ]
Wang, Xiaogang [4 ]
Fieguth, Paul [5 ]
Chen, Jie [2 ]
Liu, Xinwang [1 ]
Pietikainen, Matti [2 ]
机构
[1] Natl Univ Def Technol, Changsha, Peoples R China
[2] Univ Oulu, Oulu, Finland
[3] Univ Sydney, Camperdown, NSW, Australia
[4] Chinese Univ Hong Kong, Sha Tin, Hong Kong, Peoples R China
[5] Univ Waterloo, Waterloo, ON, Canada
基金
中国国家自然科学基金;
关键词
Object detection; Deep learning; Convolutional neural networks; Object recognition; CONVOLUTIONAL NETWORKS; RECOGNITION; CLASSIFICATION; REPRESENTATION; SEGMENTATION; LOCALIZATION; ANNOTATION; GRADIENTS; CONTEXT; IMAGES;
D O I
10.1007/s11263-019-01247-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection, one of the most fundamental and challenging problems in computer vision, seeks to locate object instances from a large number of predefined categories in natural images. Deep learning techniques have emerged as a powerful strategy for learning feature representations directly from data and have led to remarkable breakthroughs in the field of generic object detection. Given this period of rapid evolution, the goal of this paper is to provide a comprehensive survey of the recent achievements in this field brought about by deep learning techniques. More than 300 research contributions are included in this survey, covering many aspects of generic object detection: detection frameworks, object feature representation, object proposal generation, context modeling, training strategies, and evaluation metrics. We finish the survey by identifying promising directions for future research.
引用
收藏
页码:261 / 318
页数:58
相关论文
共 321 条
[1]  
Agrawal P, 2014, LECT NOTES COMPUT SC, V8695, P329, DOI 10.1007/978-3-319-10584-0_22
[2]   Measuring the Objectness of Image Windows [J].
Alexe, Bogdan ;
Deselaers, Thomas ;
Ferrari, Vittorio .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2189-2202
[3]  
Alexe B, 2010, PROC CVPR IEEE, P73, DOI 10.1109/CVPR.2010.5540226
[4]  
Alvarez Jose M., 2016, ADV NEURAL INFORM PR, V29, P2262
[5]   50 Years of object recognition: Directions forward [J].
Andreopoulos, Alexander ;
Tsotsos, John K. .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (08) :827-891
[6]  
[Anonymous], 2014, CORR
[7]  
[Anonymous], 1521 MIT AI
[8]  
[Anonymous], 2010, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, DOI DOI 10.1109/CVPR.2010.5539906
[9]  
[Anonymous], IJCV
[10]  
[Anonymous], 2015, P IEEE C COMP VIS PA, DOI [10.1109/CVPR.2015.7298801, DOI 10.1109/CVPR.2015.7298801]