Efficient railway track region segmentation algorithm based on lightweight neural network and cross-fusion decoder

被引：32

作者：

Chen, Zhichao ^{[1
,3
]}

Yang, Jie ^{[1
,2
,3
]}

Chen, Lifang ^{[4
]}

Feng, Zhicheng ^{[1
,3
]}

Jia, Limin ^{[5
]}

机构：

[1] Jiangxi Univ Sci & Technol, Dept Elect Engn & Automat, Ganzhou 341000, Jiangxi, Peoples R China

[2] Chinese Acad Sci, Ganjiang Innovat Acad, Ganzhou 341000, Jiangxi, Peoples R China

[3] Jiangxi Prov Key Lab Maglev Technol, Ganzhou 341000, Jiangxi, Peoples R China

[4] Jiangxi Univ Sci & Technol, Dept Sci, Ganzhou 341000, Jiangxi, Peoples R China

[5] Beijing Jiaotong Univ, State Key Lab Railway Traff Control & Safety, Beijing 100044, Peoples R China

来源：

AUTOMATION IN CONSTRUCTION | 2023年 / 155卷

基金：

中国国家自然科学基金;

关键词：

Railway safety; Semantic segmentation; Real-time; Railway track region segmentation; Cross-fusion decoder;

D O I：

10.1016/j.autcon.2023.105069

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

To segment railway track regions in real-time for intrusion detection and improving security, this paper proposes an efficient railway track region segmentation network (ERTNet) based on the encoder-decoder architecture. Firstly, to ensure the lightweight of the encoder, depthwise convolution and the channel shuffle are utilized to construct sandglass-type feature extraction unit. Secondly, a feature-matching-based cross-fusion decoder is utilized to fuse deep and shallow feature maps. Thirdly, the knowledge distillation is employed with large-scale Deeplab v3+ as the teacher model to improve performance. Additionally, a loss function is proposed to penalize pixel points with large offsets. Finally, the ERTNet is validated on the self-built dataset, achieving an MIoU (Mean Intersection over Union) of 92.4% , which is 5.22% improvement over the benchmark model. ERTNet achieves a balance between segmentation accuracy and computational efficiency, requiring only 0.5 M parameters and 0.92 G FLOPs (Floating Point Operations).

引用

页数：13

共 50 条

[21] Efficient attention-based deep encoder and decoder for automatic crack segmentation [J].

Kang, Dong H. ;

Cha, Young-Jin .

STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2022, 21 (05) :2190-2205

[22] Intelligent Deployment Solution for Tabling Adapting Deep Learning [J].

Keshun, You ;

Huizhong, Liu .

IEEE ACCESS, 2023, 11 :22201-22208

[23]

Le Saux B, 2018, INT GEOSCI REMOTE SE, P4819, DOI 10.1109/IGARSS.2018.8517865

[24]

Liu ZH, 2021, ADV NEUR IN, V34

[25]

Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965

[26] Multiscale edge detection based on Gaussian smoothing and edge tracking [J].

Lopez-Molina, C. ;

De Baets, B. ;

Bustince, H. ;

Sanz, J. ;

Barrenechea, E. .

KNOWLEDGE-BASED SYSTEMS, 2013, 44 :101-111

[27] ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design [J].

Ma, Ningning ;

Zhang, Xiangyu ;

Zheng, Hai-Tao ;

Sun, Jian .

COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :122-138

[28] Deep learning-based active noise control on construction sites [J].

Mostafavi, Alireza ;

Cha, Young-Jin .

AUTOMATION IN CONSTRUCTION, 2023, 151

[29] Vision Transformers for Dense Prediction [J].

Ranftl, Rene ;

Bochkovskiy, Alexey ;

Koltun, Vladlen .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :12159-12168

[30]

Reddy AS, 2018, PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES 2018), P229, DOI 10.1109/CESYS.2018.8723981

← 1 2 3 4 5 →