Long-Distance Person Detection Based on YOLOv7

被引:28
作者
Tang, Fan [1 ,2 ]
Yang, Fang [1 ,2 ]
Tian, Xianqing [1 ,2 ]
机构
[1] Hebei Univ, Sch Cyberspace Secur & Comp, Baoding 071000, Peoples R China
[2] Hebei Univ, Inst Intelligence Image & Document Informat Proc, Baoding 071000, Peoples R China
关键词
object detection; YOLOv7; recursive gated convolution; tiny object detection layer; coordinate attention mechanism;
D O I
10.3390/electronics12061502
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the research field of small object detection, most object detectors have been successfully used for pedestrian detection, face recognition, lost and found, and automatic driving, among other applications, and have achieved good results. However, when general object detectors encounter challenging low-resolution images from the TinyPerson dataset, they will produce undesirable detection results because of the dense occlusion between people and different body poses. In order to solve these problems, this paper proposes a tiny object detection method TOD-YOLOv7 based on YOLOv7.First, this paper presents a reconstruction of the YOLOv7 network by adding a tiny object detection layer to enhance its detection ability. Then, we use the recursive gated convolution module to realize the interaction with the higher-order space to accelerate the model initialization process and reduce the reasoning time. Secondly, this paper proposes the integration of a coordinate attention mechanism into the YOLOv7 feature extraction network to strengthen the pedestrian object information and weaken the background information.Additionally, we leverage data augmentation techniques to improve the representation learning of the algorithm. The results show that compared with the baseline model YOLOv7, the detection accuracy of this model on the TinyPerson dataset is improved from 7.1% to 9.5%, and the detection speed reaches 208 frames per second (FPS). The algorithm of this paper is shown to achieve better detection results for tiny object detection.
引用
收藏
页数:14
相关论文
共 38 条
[21]  
Rao YM, 2022, Arxiv, DOI arXiv:2207.14284
[22]  
Redmon J, 2018, Arxiv, DOI arXiv:1804.02767
[23]   YOLO9000: Better, Faster, Stronger [J].
Redmon, Joseph ;
Farhadi, Ali .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6517-6525
[24]   You Only Look Once: Unified, Real-Time Object Detection [J].
Redmon, Joseph ;
Divvala, Santosh ;
Girshick, Ross ;
Farhadi, Ali .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :779-788
[25]   Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].
Ren, Shaoqing ;
He, Kaiming ;
Girshick, Ross ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149
[26]  
Vaswani A, 2017, ADV NEUR IN, V30
[27]  
Wang C.Y., 2022, arXiv
[28]   High-Order Information Matters: Learning Relation and Topology for Occluded Person Re-Identification [J].
Wang, Guan'an ;
Yang, Shuo ;
Liu, Huanyu ;
Wang, Zhicheng ;
Yang, Yang ;
Wang, Shuliang ;
Yu, Gang ;
Zhou, Erjin ;
Sun, Jian .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6448-6457
[29]  
Wang J., 2021, arXiv
[30]   Repulsion Loss: Detecting Pedestrians in a Crowd [J].
Wang, Xinlong ;
Xiao, Tete ;
Jiang, Yuning ;
Shao, Shuai ;
Sun, Jian ;
Shen, Chunhua .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7774-7783