Focus-and-Detect: A small object detection framework for aerial images

被引:65
作者
Koyun, Onur Can [1 ]
Keser, Reyhan Kevser [1 ]
Akkaya, Ibrahim Batuhan [2 ]
Toreyin, Behcet Ugur [1 ]
机构
[1] Istanbul Tech Univ, Informat Inst, TR-34469 Istanbul, Turkey
[2] Aselsan, TR-06200 Ankara, Turkey
关键词
Object detection; Small object detection; Region search; Aerial images;
D O I
10.1016/j.image.2022.116675
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Despite recent advances, object detection in aerial images is still a challenging task. Specific problems in aerial images makes the detection problem harder, such as small objects, densely packed objects, objects in different sizes and with different orientations. To address small object detection problem, we propose a two-stage object detection framework called "Focus-and-Detect". The first stage which consists of an object detector network supervised by a Gaussian Mixture Model, generates clusters of objects constituting the focused regions. The second stage, which is also an object detector network, predicts objects within the focal regions. Incomplete Box Suppression (IBS) method is also proposed to overcome the truncation effect of region search approach. Results indicate that the proposed two-stage framework achieves an AP score of 42.06 on VisDrone validation dataset, surpassing all other state-of-the-art small object detection methods reported in the literature, to the best of authors' knowledge.
引用
收藏
页数:9
相关论文
共 63 条
[1]  
[Anonymous], 2017, INT C COMPUT VIS ICC, DOI DOI 10.1109/ICCV.2017.322
[2]   Soft-NMS - Improving Object Detection With One Line of Code [J].
Bodla, Navaneeth ;
Singh, Bharat ;
Chellappa, Rama ;
Davis, Larry S. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5562-5570
[3]  
Butte S., 2021, ARXIV PREPRINT ARXIV
[4]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[5]   RRNet: A Hybrid Detector for Object Detection in Drone-captured Images [J].
Chen, Changrui ;
Zhang, Yu ;
Lv, Qingxuan ;
Wei, Shuo ;
Wang, Xiaorui ;
Sun, Xin ;
Dong, Junyu .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :100-108
[6]  
Chen K., 2019, arXiv:1906.07155
[7]   RIFD-CNN: Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection [J].
Cheng, Gong ;
Zhou, Peicheng ;
Han, Junwei .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2884-2893
[8]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[9]  
Dai X., 2021, CVPR, P7373
[10]   HybridNet: A fast vehicle detection system for autonomous driving [J].
Dai, Xuerui .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2019, 70 :79-88