A lightweight coal mine pedestrian detector for video surveillance systems with multi-level feature fusion and channel pruning

被引:0
作者
Xie, Bei Jing [1 ]
Li, Heng [1 ]
Luan, Zheng [1 ]
Li, Xiao Xu [1 ]
Lei, Zhen [2 ]
机构
[1] China Univ Min & Technol Beijing, Sch Emergency Management & Safety Engn, Beijing 100083, Peoples R China
[2] Guizhou Inst Technol, Sch Min Engn, Guiyang 550000, Guizhou, Peoples R China
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
关键词
Coal mine pedestrian detection; Video surveillance; Lightweight architecture; Channel pruning; Accident prevention;
D O I
10.1038/s41598-025-87157-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Pedestrian detection in coal mines is crucial for video surveillance systems. Limited computational resources pose challenges to deploying large models, affecting detection efficiency. To address this, we propose a lightweight pedestrian in coal mine detector with multi-level feature fusion. Our approach integrates the backbone network with coordinate attention, introducing a bidirectional feature pyramid network and a thin neck technique to enhance multi-scale detection capability while reducing computational load. We also employ regression loss with a dynamic focus mechanism for bounding box regression to minimize model errors. The Linkage Channel Pruning method enforces channel-level sparsity on the designed detector to achieve network slimming and secondary lightweight development. Results on a proprietary dataset demonstrate our method's parameters (0.61 M), computational load (2.0 GFLOPs), model size (1.48 MB), detection accuracy (0.966), and inference time (2.1 ms). Compared to the baseline, our method achieves a 4.96 x reduction in parameters, a 4.05 x reduction in computational load, a 4.02 x reduction in model size, a 59.62% reduction in inference time, and a 1.2% accuracy improvement. Experimental validation on proprietary and public datasets confirms that our method exhibits state-of-the-art lightweight performance, accuracy, and real-time capability, demonstrating significant potential in practical engineering applications. The insights gained provide technical references and real-time accident prevention for coal mine video surveillance systems.
引用
收藏
页数:25
相关论文
共 56 条
  • [1] Adarsh P, 2020, INT CONF ADVAN COMPU, P687, DOI [10.1109/icaccs48705.2020.9074315, 10.1109/ICACCS48705.2020.9074315]
  • [2] [Anonymous], 2012, VOC2012 RESULTS
  • [3] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]
  • [4] Cai ZW, 2017, Arxiv, DOI [arXiv:1712.00726, DOI 10.48550/ARXIV.1712.00726]
  • [5] Carion N., 2020, PREPRINT, DOI [10.1007/978-3-030-58452-813, DOI 10.1007/978-3-030-58452-813]
  • [6] Chen K., 2024, MMDetection: Open MMLab Detection Toolbox and Benchmark
  • [7] Cheng HR, 2024, Arxiv, DOI [arXiv:2308.06767, DOI 10.48550/ARXIV.2308.06767]
  • [8] Real-time defects detection for apple sorting using NIR cameras with pruning-based YOLOV4 network
    Fan, Shuxiang
    Liang, Xiaoting
    Huang, Wenqian
    Zhang, Vincent Jialong
    Pang, Qi
    He, Xin
    Li, Lianjie
    Zhang, Chi
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 193
  • [9] Howard AG, 2017, Arxiv, DOI arXiv:1704.04861
  • [10] Ge Z., 2021, arXiv, DOI arXiv:2107.08430