Efficient railway track region segmentation algorithm based on lightweight neural network and cross-fusion decoder

被引：32

作者：

Chen, Zhichao ^{[1
,3
]}

Yang, Jie ^{[1
,2
,3
]}

Chen, Lifang ^{[4
]}

Feng, Zhicheng ^{[1
,3
]}

Jia, Limin ^{[5
]}

机构：

[1] Jiangxi Univ Sci & Technol, Dept Elect Engn & Automat, Ganzhou 341000, Jiangxi, Peoples R China

[2] Chinese Acad Sci, Ganjiang Innovat Acad, Ganzhou 341000, Jiangxi, Peoples R China

[3] Jiangxi Prov Key Lab Maglev Technol, Ganzhou 341000, Jiangxi, Peoples R China

[4] Jiangxi Univ Sci & Technol, Dept Sci, Ganzhou 341000, Jiangxi, Peoples R China

[5] Beijing Jiaotong Univ, State Key Lab Railway Traff Control & Safety, Beijing 100044, Peoples R China

来源：

AUTOMATION IN CONSTRUCTION | 2023年 / 155卷

基金：

中国国家自然科学基金;

关键词：

Railway safety; Semantic segmentation; Real-time; Railway track region segmentation; Cross-fusion decoder;

D O I：

10.1016/j.autcon.2023.105069

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

To segment railway track regions in real-time for intrusion detection and improving security, this paper proposes an efficient railway track region segmentation network (ERTNet) based on the encoder-decoder architecture. Firstly, to ensure the lightweight of the encoder, depthwise convolution and the channel shuffle are utilized to construct sandglass-type feature extraction unit. Secondly, a feature-matching-based cross-fusion decoder is utilized to fuse deep and shallow feature maps. Thirdly, the knowledge distillation is employed with large-scale Deeplab v3+ as the teacher model to improve performance. Additionally, a loss function is proposed to penalize pixel points with large offsets. Finally, the ERTNet is validated on the self-built dataset, achieving an MIoU (Mean Intersection over Union) of 92.4% , which is 5.22% improvement over the benchmark model. ERTNet achieves a balance between segmentation accuracy and computational efficiency, requiring only 0.5 M parameters and 0.92 G FLOPs (Floating Point Operations).

引用

页数：13

共 50 条

[41]

Xie EZ, 2021, ADV NEUR IN, V34

[42] Cross-Image Relational Knowledge Distillation for Semantic Segmentation [J].

Yang, Chuanguang ;

Zhou, Helong ;

An, Zhulin ;

Jiang, Xue ;

Xu, Yongjun ;

Zhang, Qian .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :12309-12318

[43] Foreign Body Detection in Rail Transit Based on a Multi-Mode Feature-Enhanced Convolutional Neural Network [J].

Ye, Tao ;

Zhang, Jun ;

Zhao, Zongyang ;

Zhou, Fuqiang .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) :18051-18063

[44] Research on intelligent implementation of the beneficiation process of shaking table [J].

You, Keshun ;

Wen, Chengyu ;

Liu, Huizhong .

MINERALS ENGINEERING, 2023, 199

[45] BiSeNet V2: Bilateral Network with Guided Aggregation for Real-Time Semantic Segmentation [J].

Yu, Changqian ;

Gao, Changxin ;

Wang, Jingbo ;

Yu, Gang ;

Shen, Chunhua ;

Sang, Nong .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (11) :3051-3068

[46] BiSeNet: Bilateral Segmentation Network for Real-Time Semantic Segmentation [J].

Yu, Changqian ;

Wang, Jingbo ;

Peng, Chao ;

Gao, Changxin ;

Yu, Gang ;

Sang, Nong .

COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 :334-349

[47]

Yu F, 2022, AAAI CONF ARTIF INTE, P3143

[48] Pyramid Scene Parsing Network [J].

Zhao, Hengshuang ;

Shi, Jianping ;

Qi, Xiaojuan ;

Wang, Xiaogang ;

Jia, Jiaya .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6230-6239

[49] Rethinking Bottleneck Structure for Efficient Mobile Network Design [J].

Zhou, Daquan ;

Hou, Qibin ;

Chen, Yunpeng ;

Feng, Jiashi ;

Yan, Shuicheng .

COMPUTER VISION - ECCV 2020, PT III, 2020, 12348 :680-697

[50] Crack segmentation through deep convolutional neural networks and heterogeneous image fusion [J].

Zhou, Shanglian ;

Song, Wei .

AUTOMATION IN CONSTRUCTION, 2021, 125 (125)

← 1 2 3 4 5 →