Multilevel attention imitation knowledge distillation for RGB-thermal transmission line detection

被引:4
作者
Guo, Xiaodong [1 ]
Zhou, Wujie [2 ,3 ]
Liu, Tong [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Hangzhou 310023, Peoples R China
[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 308232, Singapore
基金
中国国家自然科学基金;
关键词
Transmission line detection; Convolutional neural networks; Multi-modal; Knowledge distillation; SALIENT OBJECT DETECTION; NETWORK; FUSION;
D O I
10.1016/j.eswa.2024.125406
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transmission line detection (TLD) plays a crucial role in ensuring the safety and stability of electricity supply. Applying RGB-thermal convolutional neural networks (CNNs) to unmanned aerial vehicles (UAVs) photography is a valuable alternative for diagnosing transmission line faults. However, existing CNNs struggle to generalize to TLD due to the clustered backgrounds and variable weather conditions. In addition, the limited computational resources and storage space of UAVs pose challenges for the lightweight design of models. In the present study, we developed a novel multilevel attention imitation knowledge distillation structure comprising a highperforming teacher model called MAINet-T and a compact student model called MAINet-S. We aimed to 1) improve the accuracy and robustness of TLD and 2) optimize the performance and capacity of the model for deployment on UAVs. The MAINet-T has a three-stage feature aggregation module and a detailed enhancement module to facilitate the processes of multi-modal and multilevel feature complement and interaction. To balance model performance and capacity for deployment, we proposed a novel KD strategy, including response distillation and feature distillation, to obtain an optimized model called MAINet-S*. Within feature distillation, we proposed a multilevel attention imitation module to integrate the advantages of the attention maps in different stages of the encoder. In experiments based on the VITLD dataset, MAINet-S* outperformed 15 state-of-the-art methods, with a 66.2% reduction in the number of weight parameters (Params) and a 69.9% increase in floating-point operations (FLOPs) compared with MAINet-T.
引用
收藏
页数:13
相关论文
共 56 条
[11]   Review of Visual Saliency Detection With Comprehensive Information [J].
Cong, Runmin ;
Lei, Jianjun ;
Fu, Huazhu ;
Cheng, Ming-Ming ;
Lin, Weisi ;
Huang, Qingming .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (10) :2941-2959
[12]   Structure-measure: A New Way to Evaluate Foreground Maps [J].
Fan, Deng-Ping ;
Cheng, Ming-Ming ;
Liu, Yun ;
Li, Tao ;
Borji, Ali .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4558-4567
[13]   Efficient Parallel Branch Network With Multi-Scale Feature Fusion for Real-Time Overhead Power Line Segmentation [J].
Gao, Zishu ;
Yang, Guodong ;
Li, En ;
Liang, Zize ;
Guo, Rui .
IEEE SENSORS JOURNAL, 2021, 21 (10) :12220-12227
[14]  
Hinton G, 2015, Arxiv, DOI [arXiv:1503.02531, DOI 10.48550/ARXIV.1503.02531]
[15]   Fault Detection in Power Equipment via an Unmanned Aerial System Using Multi Modal Data [J].
Jalil, Bushra ;
Leone, Giuseppe Riccardo ;
Martinelli, Massimo ;
Moroni, Davide ;
Pascali, Maria Antonietta ;
Berton, Andrea .
SENSORS, 2019, 19 (13)
[16]   CDNet: Complementary Depth Network for RGB-D Salient Object Detection [J].
Jin, Wen-Da ;
Xu, Jun ;
Han, Qi ;
Zhang, Yi ;
Cheng, Ming-Ming .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :3376-3390
[17]  
Ju R, 2014, IEEE IMAGE PROC, P1115, DOI 10.1109/ICIP.2014.7025222
[18]   Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection [J].
Li, Gongyang ;
Liu, Zhi ;
Chen, Minyu ;
Bai, Zhen ;
Lin, Weisi ;
Ling, Haibin .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :3528-3542
[19]   Power Line Detection by Pyramidal Patch Classification [J].
Li, Yan ;
Pan, Chaofeng ;
Cao, Xianbin ;
Wu, Dapeng .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2019, 3 (06) :416-426
[20]   MEANet: An effective and lightweight solution for salient object detection in optical remote sensing images [J].
Liang, Bocheng ;
Luo, Huilan .
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238