Improved Mask R-CNN for obstacle detection of rail transit

被引:59
作者
He, Deqiang [1 ]
Qiu, Yefeng [1 ]
Miao, Jian [1 ]
Zou, Zhiheng [1 ]
Li, Kai [1 ]
Ren, Chonghui [2 ]
Shen, Guoqiang [2 ]
机构
[1] Guangxi Univ, Sch Mech Engn, Guangxi Key Lab Mfg Syst & Adv Mfg Technol, Nanning 530004, Peoples R China
[2] Nanning Rail Transit Co Ltd, Nanning 530029, Peoples R China
关键词
Obstacle detection; Rail transit; Mask R-CNN; Deep learning; Image processing;
D O I
10.1016/j.measurement.2022.110728
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Accurate identification of obstacles shows great significance to improve the safety of automatic operation trains. The ME Mask R-CNN is proposed to improve the accuracy of active identification. The SSwin-Le Transformer is used as the feature extraction network and the ME-PAPN is used as the feature fusion network. A variety of multiscale enhancement methods are integrated to improve the detection ability of small target objects. PrIme sample attention is used as the sampling method, the anchor boxes size and ratio suitable for the characteristics of train obstacles are adopted. The train obstacle dataset is based on a variety of test scenarios such as Nanning Metro Line 1 test line, tunnel line and night test. The test results show that ME Mask R-CNN achieves 91.3 % mAP with an average detection time of 4.2 FPS, which is 11.1 % higher than that of Mask R-CNN.
引用
收藏
页数:10
相关论文
共 45 条
[1]  
Aminmansour S., 2014, PROC SPE INTELL ENER, P1, DOI 10.1109/DICTA.2014.7008119
[2]  
[Anonymous], 2015, 3 INT C LEARN REPR I
[3]  
[Anonymous], 2008, COMPUT VIS IMAGE UND, DOI DOI 10.1016/j.cviu.2007.09.014
[4]   A machine-learning fatigue life prediction approach of additively manufactured metals [J].
Bao, Hongyixi ;
Wu, Shengchuan ;
Wu, Zhengkai ;
Kang, Guozheng ;
Peng, Xin ;
Withers, Philip J. .
ENGINEERING FRACTURE MECHANICS, 2021, 242
[5]  
Bochkovskiy A., 2020, PREPRINT
[6]   Prime Sample Attention in Object Detection [J].
Cao, Yuhang ;
Chen, Kai ;
Loy, Chen Change ;
Lin, Dahua .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11580-11588
[7]  
Chen K., 2019, CoRR abs/1906.07155
[8]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[9]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[10]  
Felzenszwalb P, 2008, PROC CVPR IEEE, P1984