An improved object detection algorithm based on multi-scaled and deformable convolutional neural networks

被引:63
作者
Cao, Danyang [1 ,2 ]
Chen, Zhixin [1 ]
Gao, Lei [1 ]
机构
[1] North China Univ Technol, Sch Informat Sci & Technol, Beijing 100144, Peoples R China
[2] Beijing Key Lab Integrat & Anal Large Scale Strea, Beijing 100144, Peoples R China
基金
北京市自然科学基金;
关键词
Object detection; Machine learning; AI; Deformable convolution; Computer vision; FUSION;
D O I
10.1186/s13673-020-00219-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Object detection methods aim to identify all target objects in the target image and determine the categories and position information in order to achieve machine vision understanding. Numerous approaches have been proposed to solve this problem, mainly inspired by methods of computer vision and deep learning. However, existing approaches always perform poorly for the detection of small, dense objects, and even fail to detect objects with random geometric transformations. In this study, we compare and analyse mainstream object detection algorithms and propose a multi-scaled deformable convolutional object detection network to deal with the challenges faced by current methods. Our analysis demonstrates a strong performance on par, or even better, than state of the art methods. We use deep convolutional networks to obtain multi-scaled features, and add deformable convolutional structures to overcome geometric transformations. We then fuse the multi-scaled features by up sampling, in order to implement the final object recognition and region regress. Experiments prove that our suggested framework improves the accuracy of detecting small target objects with geometric deformation, showing significant improvements in the trade-off between accuracy and speed.
引用
收藏
页数:22
相关论文
共 44 条
[1]   A Hybrid Proposed Framework for Object Detection and Classification [J].
Aamir, Muhammad ;
Pu, Yi-Fei ;
Rahman, Ziaur ;
Abro, Waheed Ahmed ;
Naeem, Hamad ;
Ullah, Farhan ;
Badr, Aymen Mudheher .
JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2018, 14 (05) :1176-1194
[2]  
[Anonymous], 2013, Darknet: Open source neural networks in C
[3]  
[Anonymous], 2014, P IEEE C COMP VIS PA
[4]  
[Anonymous], 2015, PROC CVPR IEEE
[5]  
[Anonymous], PROC CVPR IEEE
[6]  
[Anonymous], OPENIMAGES PUBLIC DA
[7]  
[Anonymous], 2010, International journal of computer vision, DOI DOI 10.1007/s11263-009-0275-4
[8]  
[Anonymous], ADV NEURAL INFORM PR
[9]  
[Anonymous], ADV NEURAL INFORM PR
[10]  
[Anonymous], APPL RES COMPUT