Long-Distance Person Detection Based on YOLOv7

被引:28
作者
Tang, Fan [1 ,2 ]
Yang, Fang [1 ,2 ]
Tian, Xianqing [1 ,2 ]
机构
[1] Hebei Univ, Sch Cyberspace Secur & Comp, Baoding 071000, Peoples R China
[2] Hebei Univ, Inst Intelligence Image & Document Informat Proc, Baoding 071000, Peoples R China
关键词
object detection; YOLOv7; recursive gated convolution; tiny object detection layer; coordinate attention mechanism;
D O I
10.3390/electronics12061502
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the research field of small object detection, most object detectors have been successfully used for pedestrian detection, face recognition, lost and found, and automatic driving, among other applications, and have achieved good results. However, when general object detectors encounter challenging low-resolution images from the TinyPerson dataset, they will produce undesirable detection results because of the dense occlusion between people and different body poses. In order to solve these problems, this paper proposes a tiny object detection method TOD-YOLOv7 based on YOLOv7.First, this paper presents a reconstruction of the YOLOv7 network by adding a tiny object detection layer to enhance its detection ability. Then, we use the recursive gated convolution module to realize the interaction with the higher-order space to accelerate the model initialization process and reduce the reasoning time. Secondly, this paper proposes the integration of a coordinate attention mechanism into the YOLOv7 feature extraction network to strengthen the pedestrian object information and weaken the background information.Additionally, we leverage data augmentation techniques to improve the representation learning of the algorithm. The results show that compared with the baseline model YOLOv7, the detection accuracy of this model on the TinyPerson dataset is improved from 7.1% to 9.5%, and the detection speed reaches 208 frames per second (FPS). The algorithm of this paper is shown to achieve better detection results for tiny object detection.
引用
收藏
页数:14
相关论文
共 38 条
[1]  
[Anonymous], 2017, ARXIV170306870
[2]  
Bochkovskiy A., 2020, YOLOv4: Optimal Speed and Accuracy of Object Detection, Vabs/2004.10934, P1
[3]   Cascade R-CNN: Delving into High Quality Object Detection [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162
[4]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[5]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[6]   Pedestrian Detection: An Evaluation of the State of the Art [J].
Dollar, Piotr ;
Wojek, Christian ;
Schiele, Bernt ;
Perona, Pietro .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (04) :743-761
[7]  
Dollár P, 2009, PROC CVPR IEEE, P304, DOI 10.1109/CVPRW.2009.5206631
[8]  
Dosovitskiy Alexey, 2021, P ICLR
[9]   Monocular Pedestrian Detection: Survey and Experiments [J].
Enzweiler, Markus ;
Gavrila, Dariu M. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (12) :2179-2195
[10]  
Harris E., 2020, ARXIV