Efficient railway track region segmentation algorithm based on lightweight neural network and cross-fusion decoder

被引:32
作者
Chen, Zhichao [1 ,3 ]
Yang, Jie [1 ,2 ,3 ]
Chen, Lifang [4 ]
Feng, Zhicheng [1 ,3 ]
Jia, Limin [5 ]
机构
[1] Jiangxi Univ Sci & Technol, Dept Elect Engn & Automat, Ganzhou 341000, Jiangxi, Peoples R China
[2] Chinese Acad Sci, Ganjiang Innovat Acad, Ganzhou 341000, Jiangxi, Peoples R China
[3] Jiangxi Prov Key Lab Maglev Technol, Ganzhou 341000, Jiangxi, Peoples R China
[4] Jiangxi Univ Sci & Technol, Dept Sci, Ganzhou 341000, Jiangxi, Peoples R China
[5] Beijing Jiaotong Univ, State Key Lab Railway Traff Control & Safety, Beijing 100044, Peoples R China
基金
中国国家自然科学基金;
关键词
Railway safety; Semantic segmentation; Real-time; Railway track region segmentation; Cross-fusion decoder;
D O I
10.1016/j.autcon.2023.105069
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
To segment railway track regions in real-time for intrusion detection and improving security, this paper proposes an efficient railway track region segmentation network (ERTNet) based on the encoder-decoder architecture. Firstly, to ensure the lightweight of the encoder, depthwise convolution and the channel shuffle are utilized to construct sandglass-type feature extraction unit. Secondly, a feature-matching-based cross-fusion decoder is utilized to fuse deep and shallow feature maps. Thirdly, the knowledge distillation is employed with large-scale Deeplab v3+ as the teacher model to improve performance. Additionally, a loss function is proposed to penalize pixel points with large offsets. Finally, the ERTNet is validated on the self-built dataset, achieving an MIoU (Mean Intersection over Union) of 92.4% , which is 5.22% improvement over the benchmark model. ERTNet achieves a balance between segmentation accuracy and computational efficiency, requiring only 0.5 M parameters and 0.92 G FLOPs (Floating Point Operations).
引用
收藏
页数:13
相关论文
共 50 条
[41]  
Xie EZ, 2021, ADV NEUR IN, V34
[42]   Cross-Image Relational Knowledge Distillation for Semantic Segmentation [J].
Yang, Chuanguang ;
Zhou, Helong ;
An, Zhulin ;
Jiang, Xue ;
Xu, Yongjun ;
Zhang, Qian .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :12309-12318
[43]   Foreign Body Detection in Rail Transit Based on a Multi-Mode Feature-Enhanced Convolutional Neural Network [J].
Ye, Tao ;
Zhang, Jun ;
Zhao, Zongyang ;
Zhou, Fuqiang .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) :18051-18063
[44]   Research on intelligent implementation of the beneficiation process of shaking table [J].
You, Keshun ;
Wen, Chengyu ;
Liu, Huizhong .
MINERALS ENGINEERING, 2023, 199
[45]   BiSeNet V2: Bilateral Network with Guided Aggregation for Real-Time Semantic Segmentation [J].
Yu, Changqian ;
Gao, Changxin ;
Wang, Jingbo ;
Yu, Gang ;
Shen, Chunhua ;
Sang, Nong .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (11) :3051-3068
[46]   BiSeNet: Bilateral Segmentation Network for Real-Time Semantic Segmentation [J].
Yu, Changqian ;
Wang, Jingbo ;
Peng, Chao ;
Gao, Changxin ;
Yu, Gang ;
Sang, Nong .
COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 :334-349
[47]  
Yu F, 2022, AAAI CONF ARTIF INTE, P3143
[48]   Pyramid Scene Parsing Network [J].
Zhao, Hengshuang ;
Shi, Jianping ;
Qi, Xiaojuan ;
Wang, Xiaogang ;
Jia, Jiaya .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6230-6239
[49]   Rethinking Bottleneck Structure for Efficient Mobile Network Design [J].
Zhou, Daquan ;
Hou, Qibin ;
Chen, Yunpeng ;
Feng, Jiashi ;
Yan, Shuicheng .
COMPUTER VISION - ECCV 2020, PT III, 2020, 12348 :680-697
[50]   Crack segmentation through deep convolutional neural networks and heterogeneous image fusion [J].
Zhou, Shanglian ;
Song, Wei .
AUTOMATION IN CONSTRUCTION, 2021, 125 (125)