Rep-YOLO: an efficient detection method for mine personnel

被引：10

作者：

Shao, Xiaoqiang ^{[1
,2
]}

Liu, Shibo ^{[1
,2
]}

Li, Xin ^{[1
,2
]}

Lyu, Zhiyue ^{[1
,2
]}

Li, Hao ^{[1
,2
]}

机构：

[1] Xian Univ Sci & Technol, Coll Elect & Control Engn, Xian 710054, Peoples R China

[2] Xian Key Lab Elect Equipment Condit Monitoring & P, Xian 710054, Peoples R China

来源：

JOURNAL OF REAL-TIME IMAGE PROCESSING | 2024年 / 21卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Mine personnel detection; Rep-YOLO; CVCA; DER backbone; Slim-neck;

D O I：

10.1007/s11554-023-01407-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The detection of underground personnel is one of the key technologies in computer vision. However, this detection technique is susceptible to complex environments, resulting in low accuracy and slow speed. To accurately detect underground coal mine operators in complex environments, we combine the underground image features with K-means++ clustering anchor frames and propose a new Re-parameterization YOLO (Rep-YOLO) detection algorithm. First, the Criss-Cross-Vertical with Channel Attention (CVCA) mechanism is introduced at the end of the network to capture the Long-Range Dependencies (LRDs) in the image. This mechanism also emphasizes the significance of different channels to enhance image processing performance and improve the representation ability of the model. Second, the new Deep Extraction of Re-parameterization (DER) backbone network is designed, which adopts the re-parameterization structure to reduce the number of parameters and computation of the model. Additionally, each DER-block fuses different scales of features to enhance the accuracy of the model's detection capabilities. Finally, Rep-YOLO is optimized using a slim-neck structure, which reduces the complexity of the Rep-YOLO while maintaining sufficient accuracy. The results showed that the Rep-YOLO model proposed in this paper achieved an accuracy of 87.5%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$87.5\%$$\end{document}, a recall rate of 77.2%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$77.2\%$$\end{document}, an Average Precision (AP) of 83.1%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$83.1\%$$\end{document}, and a Frame Per Second (FPS) of 71.9. Compared to eight different models, the recall, AP50, and FPS of the Rep-YOLO model were improved. The research shows that the Rep-YOLO model can provide a real-time and efficient method for mine personnel detection. Source code is released in https://github.com/DrLSB/Rep-YOLO.

引用

页数：16

共 50 条

[1] [Anonymous], 2006, 2006 IEEE COMP SOC C
[2] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934]
[3] Cascade R-CNN: Delving into High Quality Object Detection
Cai, Zhaowei
Vasconcelos, Nuno
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6154 - 6162
[4] Histograms of oriented gradients for human detection
Dalal, N
Triggs, B
[J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
[5] RepVGG: Making VGG-style ConvNets Great Again
Ding, Xiaohan
Zhang, Xiangyu
Ma, Ningning
Han, Jungong
Ding, Guiguang
Sun, Jian
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13728 - 13737
[6] Sigmoid-weighted linear units for neural network function approximation in reinforcement learning
Elfwing, Stefan
Uchibe, Eiji
Doya, Kenji
[J]. NEURAL NETWORKS, 2018, 107 : 3 - 11
[7] Howard AG, 2017, Arxiv, DOI [arXiv:1704.04861, DOI 10.48550/ARXIV.1704.04861]
[8] Ge Xiaosan, 2018, Smart mine construction based on knowledge engineering and internet of things, V14, P1060
[9] Ge Z, 2021, Arxiv, DOI [arXiv:2107.08430, 10.48550/arXiv.2107.08430]
[10] Rich feature hierarchies for accurate object detection and semantic segmentation
Girshick, Ross
Donahue, Jeff
Darrell, Trevor
Malik, Jitendra
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587

← 1 2 3 4 5 →