Effective Bi-decoding networks for rail-surface defect detection by knowledge distillation

被引:0
作者
Zhou, Wujie [1 ,2 ]
Wu, Yue [1 ]
Qiu, Weiwei [1 ]
Xu, Caie [1 ]
Qiang, Fangfang [1 ]
机构
[1] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Hangzhou 310023, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Reading 308232, Singapore
基金
中国国家自然科学基金;
关键词
Bi-decoding layer; Expansion attention; Graph convolution; Knowledge distillation; Rail defect detection; Transformer;
D O I
10.1016/j.asoc.2024.112422
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
No-service rail-surface defect detection is a crucial method for assessing the quality of railroad tracks. However, the low-contrast and dark-tone characteristics of track-surface textures pose challenges to current defectmonitoring techniques. Real-time and on-site online inspections are important to ensure safe railway operation; however, most complex models for no-service inspections are difficult to deploy on mobile devices. To address these challenges and overcome the detection difficulties associated with complex scenes, we designed a knowledge distillation-based double decoding-layer refinement network (EBDNet-KD). The first decoding process is guided by a bimodal high-level semantic feature map obtained by extending the attention-based graph convolution to incrementally enhance the dual-stream features and obtain an image restoration prior. A divideand-conquer decoder is then designed to distinguish features using different decoding layers. The prior is then used in the second decoding layer, which enables the bimodal features to interact fully and obtain the final prediction map. We introduce a knowledge distillation strategy that enables a lightweight, compact student network to learn a complex teacher network's feature extraction process. This facilitates pixel-consistent learning of the knowledge within the bi-decoder layer, as well as bidirectional learning of the focused contextual response knowledge to optimize the model. The EBDNet-KD significantly reduces computational costs while guaranteeing performance with a parameter count of only 28 M. EBDNet-KD demonstrated superior performance over 15 stateof-the-art methods in experiments conducted on NEU RSDDS-AUG, an industrial RGB-depth dataset. We assessed the generalizability of EBDNet-KD by evaluating its performance on three additional public datasets, yielding competitive results. The source code and results can be found at https://github.com/Wuyue15/EBDNet.
引用
收藏
页数:14
相关论文
共 70 条
[1]   Fuzzy PD-sliding mode control design for networked system with time delays [J].
Aslam, Muhammad Shamrooz ;
Shamrooz, Summera ;
Bilal, Hazrat .
EUROPEAN JOURNAL OF CONTROL, 2024, 78
[2]   Modeling of nonlinear supply chain management with lead-times based on Takagi-Sugeno fuzzy control model [J].
Aslam, Muhammad Shamrooz ;
Bilal, Hazrat ;
Band, Shahab S. ;
Ghasemi, Peiman .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
[3]   Lqr-based PID controller with variable load tuned with data-driven methods for double inverted pendulum [J].
Aslam, Muhammad Shamrooz ;
Bilal, Hazrat ;
Hayajneh, Mohammad .
SOFT COMPUTING, 2024, 28 (01) :325-338
[4]   Semantic segmentation based on Deep learning for the detection of Cyanobacterial Harmful Algal Blooms (CyanoHABs) using synthetic images [J].
Barrientos-Espillco, Fredy ;
Gasco, Esther ;
Lopez-Gonzalez, Clara I. ;
Gomez-Silva, Maria J. ;
Pajares, Gonzalo .
APPLIED SOFT COMPUTING, 2023, 141
[5]   A practical study of active disturbance rejection control for rotary flexible joint robot manipulator [J].
Bilal, Hazrat ;
Yin, Baoqun ;
Aslam, Muhammad Shamrooz ;
Anjum, Zeeshan ;
Rohra, Avinash ;
Wang, Yizhen .
SOFT COMPUTING, 2023, 27 (08) :4987-5001
[6]   Jerk-bounded trajectory planning for rotary flexible joint manipulator: an experimental approach [J].
Bilal, Hazrat ;
Yin, Baoqun ;
Kumar, Aakash ;
Ali, Munawar ;
Zhang, Jing ;
Yao, Jinfa .
SOFT COMPUTING, 2023, 27 (07) :4029-4039
[7]   A Sound-Based Fault Diagnosis Method for Railway Point Machines Based on Two-Stage Feature Selection Strategy and Ensemble Classifier [J].
Cao, Yuan ;
Sun, Yongkui ;
Xie, Guo ;
Li, Peng .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) :12074-12083
[8]   Knowledge Distillation with the Reused Teacher Classifier [J].
Chen, Defang ;
Mei, Jian-Ping ;
Zhang, Hailin ;
Wang, Can ;
Feng, Yan ;
Chen, Chun .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :11923-11932
[9]  
Chen GB, 2017, ADV NEUR IN, V30
[10]   CIR-Net: Cross-Modality Interaction and Refinement for RGB-D Salient Object Detection [J].
Cong, Runmin ;
Lin, Qinwei ;
Zhang, Chen ;
Li, Chongyi ;
Cao, Xiaochun ;
Huang, Qingming ;
Zhao, Yao .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :6800-6815