Efficient railway track region segmentation algorithm based on lightweight neural network and cross-fusion decoder

被引:23
作者
Chen, Zhichao [1 ,3 ]
Yang, Jie [1 ,2 ,3 ]
Chen, Lifang [4 ]
Feng, Zhicheng [1 ,3 ]
Jia, Limin [5 ]
机构
[1] Jiangxi Univ Sci & Technol, Dept Elect Engn & Automat, Ganzhou 341000, Jiangxi, Peoples R China
[2] Chinese Acad Sci, Ganjiang Innovat Acad, Ganzhou 341000, Jiangxi, Peoples R China
[3] Jiangxi Prov Key Lab Maglev Technol, Ganzhou 341000, Jiangxi, Peoples R China
[4] Jiangxi Univ Sci & Technol, Dept Sci, Ganzhou 341000, Jiangxi, Peoples R China
[5] Beijing Jiaotong Univ, State Key Lab Railway Traff Control & Safety, Beijing 100044, Peoples R China
基金
中国国家自然科学基金;
关键词
Railway safety; Semantic segmentation; Real-time; Railway track region segmentation; Cross-fusion decoder;
D O I
10.1016/j.autcon.2023.105069
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
To segment railway track regions in real-time for intrusion detection and improving security, this paper proposes an efficient railway track region segmentation network (ERTNet) based on the encoder-decoder architecture. Firstly, to ensure the lightweight of the encoder, depthwise convolution and the channel shuffle are utilized to construct sandglass-type feature extraction unit. Secondly, a feature-matching-based cross-fusion decoder is utilized to fuse deep and shallow feature maps. Thirdly, the knowledge distillation is employed with large-scale Deeplab v3+ as the teacher model to improve performance. Additionally, a loss function is proposed to penalize pixel points with large offsets. Finally, the ERTNet is validated on the self-built dataset, achieving an MIoU (Mean Intersection over Union) of 92.4% , which is 5.22% improvement over the benchmark model. ERTNet achieves a balance between segmentation accuracy and computational efficiency, requiring only 0.5 M parameters and 0.92 G FLOPs (Floating Point Operations).
引用
收藏
页数:13
相关论文
共 50 条
[21]   Efficient attention-based deep encoder and decoder for automatic crack segmentation [J].
Kang, Dong H. ;
Cha, Young-Jin .
STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2022, 21 (05) :2190-2205
[22]   Intelligent Deployment Solution for Tabling Adapting Deep Learning [J].
Keshun, You ;
Huizhong, Liu .
IEEE ACCESS, 2023, 11 :22201-22208
[23]  
Le Saux B, 2018, INT GEOSCI REMOTE SE, P4819, DOI 10.1109/IGARSS.2018.8517865
[24]  
Liu ZH, 2021, ADV NEUR IN, V34
[25]  
Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965
[26]   Multiscale edge detection based on Gaussian smoothing and edge tracking [J].
Lopez-Molina, C. ;
De Baets, B. ;
Bustince, H. ;
Sanz, J. ;
Barrenechea, E. .
KNOWLEDGE-BASED SYSTEMS, 2013, 44 :101-111
[27]   ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design [J].
Ma, Ningning ;
Zhang, Xiangyu ;
Zheng, Hai-Tao ;
Sun, Jian .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :122-138
[28]   Deep learning-based active noise control on construction sites [J].
Mostafavi, Alireza ;
Cha, Young-Jin .
AUTOMATION IN CONSTRUCTION, 2023, 151
[29]   Vision Transformers for Dense Prediction [J].
Ranftl, Rene ;
Bochkovskiy, Alexey ;
Koltun, Vladlen .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :12159-12168
[30]  
Reddy AS, 2018, PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES 2018), P229, DOI 10.1109/CESYS.2018.8723981