Foreign Body Detection in Rail Transit Based on a Multi-Mode Feature-Enhanced Convolutional Neural Network

被引:21
作者
Ye, Tao [1 ,2 ,3 ]
Zhang, Jun [1 ,2 ,3 ]
Zhao, Zongyang [1 ,2 ,3 ]
Zhou, Fuqiang [4 ]
机构
[1] China Univ Min & Technol Beijing, Sch Mech Elect & Informat Engn, Beijing 100083, Peoples R China
[2] State Key Lab Coal Min & Clean Utilizat, Beijing 100013, Peoples R China
[3] Minist Emergency Management, Key Lab Intelligent Min & Robot, Beijing 100093, Peoples R China
[4] Beihang Univ, Sch Instrumentat & Optoelect Engn, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Rail transportation; Feature extraction; Object detection; Real-time systems; Convolution; Training; Rails; Railway traffic safety; object detection; deep learning; multi-mode feature enhanced convolutional neural network;
D O I
10.1109/TITS.2022.3154751
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Detection of railway traffic objects is an important task during train driving and is implemented to ensure safe driving. Although object detection has been investigated for years, many challenges exist in precisely detecting railway objects under complex railway scenes. These challenges mainly include adverse weather states, various railway backgrounds, diverse railway objects, and low-quality images. To address these issues, we introduce a novel deep learning method, called a multi-mode feature enhanced convolutional neural network (MMFE-Net), for accurate railway object detection. The network mainly consists of three modules. 1) An improved cross-stage partial connection darknet53 (CSPDarknet53), called adaptive dilated cspdarknet53, is used as our backbone to reduce image information loss. 2) A spatial feature extraction module is used to improve the feature extraction ability of the model for blurred objects and objects in a complicated background. 3) We introduce an attention fusion enhance module to strengthen the context information between adjacent feature maps to accurately detect multiscale and small objects. The proposed method achieves 0.9439 mAP and 79 FPS with an input size of 640x 640 pixels on the railway traffic dataset, and its performance is better than that of YOLOv4. Moreover, it is feasible to apply MMFE-Net into practical applications of railway object detection.
引用
收藏
页码:18051 / 18063
页数:13
相关论文
共 44 条
[1]   Video Foreground Extraction Using Multi-View Receptive Field and EncoderDecoder DCNN for Traffic and Surveillance Applications [J].
Akilan, Thangarajah ;
Wu, Q. M. Jonathan ;
Zhang, Wandong .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (10) :9478-9493
[2]  
[Anonymous], 2015, Neurocomputing, DOI [DOI 10.1016/J.NEUCOM.2015.09.116, 10. 1016/j.neucom.2015.09.116]
[3]  
Bochkovskiy A, 2020, ARXIV, DOI 10.48550/ARXIV.2004.10934
[4]   Rapid Detection of Blind Roads and Crosswalks by Using a Lightweight Semantic Segmentation Network [J].
Cao, Zhengcai ;
Xu, Xiaowen ;
Hu, Biao ;
Zhou, MengChu .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (10) :6188-6197
[5]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[6]   ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks [J].
Ding, Xiaohan ;
Guo, Yuchen ;
Ding, Guiguang ;
Han, Jungong .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1911-1920
[7]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[8]  
[郭保青 Guo Baoqing], 2018, [光学精密工程, Optics and Precision Engineering], V26, P3040
[9]  
He D., IEEE T INSTRUM MEAS, V70, P1
[10]  
He KM, 2014, LECT NOTES COMPUT SC, V8691, P346, DOI [arXiv:1406.4729, 10.1007/978-3-319-10578-9_23]