Rail image recognition technology based on deep learning

被引：0

作者：

Xu, Xinci ^{[1
]}

Shi, Xiuxia ^{[2
]}

Geng, Chenge ^{[1
]}

Chen, Xiangxian ^{[1
]}

机构：

[1] College of Biomedical Engineering & Instrument Science, Zhejiang University, Hangzhou

[2] UniTTEC Co., Ltd, Hangzhou

来源：

Journal of Railway Science and Engineering | 2024年 / 21卷 / 12期

关键词：

computer vision; deep learning; image processing; rail recognition; rail transit;

D O I：

10.19713/j.cnki.43-1423/u.T20240411

中图分类号：

学科分类号：

摘要：

In order to ensure the operation safety of the subway in urban rail transportation and avoid safety accidents caused by obstacles inside the railway tracks, it is necessary to conduct subway rail recognition. Considering that the railway tracks have slender and continuous physical characteristics, and inspired by the lane detection in deep learning, we proposed the CLRNet-L algorithm for railway track recognition, which is an improvement from the CLRNet algorithm. To solve the problem of long and thin railway tracks that are difficult to be accurately identified and localized, CLRNet-L used a feature pyramid network to extract and fuse high-level features and low-level features. Through the idea of top-down, the high-level features were first used to locate the railway tracks in a rough way, and then the shallow features fused with the high-level features were used to further refine the tracks so as to realize the identification of the railway tracks. In response to the problem of railway tracks that are difficult to distinguish from the surrounding environment due to their dark colors, attention mechanisms and multi-scale aggregators were introduced into the original CLRNet. We proposed a large kernel attention module to capture more contextual information and enhance the feature representation of railway tracks. Due to the lack of public railway track datasets for rail recognition in the field of rail transit, we used track photos collected during subway operation in Hangzhou Line 5 and Line 6 to create a dataset of track scenes. The dataset, which includes rail images of straight, curved, and turnout scenes, was used for rail recognition experiments to verify the effectiveness of the algorithm. Experimental results show that CLRNet-L achieved 88.96% MIoU and the fastest detection speed of 11.54 ms in the custom dataset, which has higher accuracy and detection speed compared with other detection algorithms. The research results provide a technical foundation for subway safety technology, especially obstacle detection, to ensure the safety of subway operations. © 2024, Central South University Press. All rights reserved.

引用

页码：5232 / 5241

页数：9

共 23 条

[11] WANG Ziguan, SHU Guohua, Research on track section identification based on traditional image processing algorithm and deep learning[J], Electrical Automation, 41, 4, (2019)
[12] PAN Xingang, SHI Jianping, LUO Ping, Et al., Spatial as deep: spatial CNN for traffic scene understanding[J], Proceedings of the AAAI Conference on Artificial Intelligence, 32, 1, (2018)
[13] NEVEN D, DE BRABANDERE B, GEORGOULIS S, Et al., Towards end-to-end lane detection: an instance segmentation approach[C], 2018 IEEE Intelligent Vehicles Symposium (IV), pp. 286-291, (2018)
[14] KO Y, LEE Y, AZAM S, Et al., Key points estimation and point instance segmentation approach for lane detection [J], IEEE Transactions on Intelligent Transportation Systems, 23, 7, pp. 8949-8958, (2022)
[15] RAN Hao, YIN Yunfei, HUANG Faliang, Et al., FLAMNet: a flexible line anchor mechanism network for lane detection, IEEE Transactions on Intelligent Transportation Systems, 24, 11, pp. 12767-12778, (2023)
[16] QIN Zequn, WANG Huanyu, LI Xi, Ultra fast structure-aware deep lane detection[C], European Conference on Computer Vision, pp. 276-291, (2020)
[17] TABELINI L, BERRIEL R, PAIXAO T M, Et al., Keep your eyes on the lane: real-time attention-guided lane detection[C], 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 294-302, (2021)
[18] ZHENG Tu, HUANG Yifei, LIU Yang, Et al., CLRNet: cross layer refinement network for lane detection[C], 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 888-897, (2022)
[19] LIN T Y, DOLLAR P, GIRSHICK R, Et al., Feature pyramid networks for object detection[C], 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 936-944, (2017)
[20] HE Kaiming, ZHANG Xiangyu, REN Shaoqing, Et al., Deep residual learning for image recognition[C], 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770-778, (2016)

← 1 2 3 →