A Survey of the Four Pillars for Small Object Detection: Multiscale Representation, Contextual Information, Super-Resolution, and Region Proposal

被引：213

作者：

Chen, Guang ^{[1
,2
,3
]}

Wang, Haitao ^{[1
]}

Chen, Kai ^{[1
]}

Li, Zhijun ^{[4
]}

Song, Zida ^{[1
]}

Liu, Yinlong ^{[3
]}

Chen, Wenkai ^{[5
]}

Knoll, Alois ^{[3
]}

机构：

[1] Tongji Univ, Sch Automot Studies, Shanghai 200092, Peoples R China

[2] Hunan Univ, State Key Lab Adv Design & Mfg Vehicle Body, Changsha 410082, Hunan, Peoples R China

[3] Tech Univ Munich, Dept Informat, D-80333 Munich, Germany

[4] Univ Sci & Technol China, Dept Automat, Hefei 230000, Peoples R China

[5] Shanghai Jiao Tong Univ, Sch Mech Engn, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2022年 / 52卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Object detection; Feature extraction; Detectors; Image resolution; Machine learning; Roads; Task analysis; Contextual information; multiscale representation; region proposal; small object dataset; small object detection; super-resolution; CONVOLUTIONAL NEURAL-NETWORK; PEDESTRIAN DETECTION; NEUROMORPHIC VISION;

D O I：

10.1109/TSMC.2020.3005231

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Although great progress has been made in generic object detection by advanced deep learning techniques, detecting small objects from images is still a difficult and challenging problem in the field of computer vision due to the limited size, less appearance, and geometry cues, and the lack of large-scale datasets of small targets. Improving the performance of small object detection has a wider significance in many real-world applications, such as self-driving cars, unmanned aerial vehicles, and robotics. In this article, the first-ever survey of recent studies in deep learning-based small object detection is presented. Our review begins with a brief introduction of the four pillars for small object detection, including multiscale representation, contextual information, super-resolution, and region-proposal. Then, the collection of state-of-the-art datasets for small object detection is listed. The performance of different methods on these datasets is reported later. Moreover, the state-of-the-art small object detection networks are investigated along with a special focus on the differences and modifications to improve the detection performance comparing to generic object detection architectures. Finally, several promising directions and tasks for future work in small object detection are provided. Researchers can track up-to-date studies on this webpage available at: https://github.com/tjtum-chenlab/SmallObjectDetectionList.

引用

页码：936 / 953

页数：18

共 131 条

[111] SUN Database: Exploring a Large Collection of Scene Categories [J].

Xiao, Jianxiong ;

Ehinger, Krista A. ;

Hays, James ;

Torralba, Antonio ;

Oliva, Aude .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 119 (01) :3-22

[112]

Xiong YJ, 2015, PROC CVPR IEEE, P1600, DOI 10.1109/CVPR.2015.7298768

[113]

Xu Fengqiang, 2018, 2018 OCEANS MTSIEEE

[114]

Xu M., 2018, MDSSD MULTISCALE DEC

[115] Detection of Sudden Pedestrian Crossings for Driving Assistance Systems [J].

Xu, Yanwu ;

Xu, Dong ;

Lin, Stephen ;

Han, Tony X. ;

Cao, Xianbin ;

Li, Xuelong .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (03) :729-739

[116] Scene Understanding in Deep Learning-Based End-to-End Controllers for Autonomous Vehicles [J].

Yang, Shun ;

Wang, Wenshuo ;

Liu, Chang ;

Deng, Weiwen .

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 49 (01) :53-63

[117] WIDER FACE: A Face Detection Benchmark [J].

Yang, Shuo ;

Luo, Ping ;

Loy, Chen Change ;

Tanal, Xiaoou .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5525-5533

[118] Detecting Small Objects in Urban Settings Using SlimNet Model [J].

Yang, Zheng ;

Liu, Yaolin ;

Liu, Lirong ;

Tang, Xinming ;

Xie, Junfeng ;

Gao, Xiaoming .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (11) :8445-8457

[119] Automatic Weakly Supervised Object Detection From High Spatial Resolution Remote Sensing Images via Dynamic Curriculum Learning [J].

Yao, Xiwen ;

Feng, Xiaoxu ;

Han, Junwei ;

Cheng, Gong ;

Guo, Lei .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (01) :675-685

[120] Semantic Annotation of High-Resolution Satellite Images via Weakly Supervised Learning [J].

Yao, Xiwen ;

Han, Junwei ;

Cheng, Gong ;

Qian, Xueming ;

Guo, Lei .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (06) :3660-3671

← 5 6 7 8 9 10 11 12 13 14 →