A method of knowledge distillation based on feature fusion and attention mechanism for complex traffic scenes

被引:9
作者
Li, Cui-jin [1 ,2 ]
Qu, Zhong [1 ]
Wang, Sheng-ye [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Chongqing 400065, Peoples R China
[2] Chongqing Inst Engn, Coll Elect Informat, Chongqing 400056, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Knowledge distillation; Attention mechanism; Feature fusion; Complex traffic scenes;
D O I
10.1016/j.engappai.2023.106533
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Object detectors based on deep learning can run smoothly on a terminal device in complex traffic scenes, and the model compression method has become a research hotspot. Considering student network single learning in the knowledge distillation algorithm, the dependence on loss function design leads to parameter sensitivity and other problems, we propose a new knowledge distillation method with second-order term attention mechanisms and feature fusion of adjacent layers. First, we build a knowledge distillation framework based on YOLOv5 and propose a new attention mechanism in the teacher network backbone to extract the hot map. Then, we combine the hot map features with the next level features through the fusion module. By fusing the useful information of the low convolution layer and the feature map of the high convolution layer to help the student network obtain the final prediction map. Finally, to improve the accuracy of small objects, we add a 160 x 160 detection head and use a transformer encoder block module to replace the convolution network of the head. Sufficient experimental results show that our method achieves state-of-the-art performance. The speed and number of parameters remain unchanged, but the average detection accuracy is 97.4% on the KITTI test set. On the Cityscapes test set, the average detection accuracy reaches 92.7%.
引用
收藏
页数:11
相关论文
共 41 条
[1]   Variational Information Distillation for Knowledge Transfer [J].
Ahn, Sungsoo ;
Hu, Shell Xu ;
Damianou, Andreas ;
Lawrence, Neil D. ;
Dai, Zhenwen .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9155-9163
[2]   STDnet: Exploiting high resolution feature maps for small object detection [J].
Bosquet, Brais ;
Mucientes, Manuel ;
Brea, Victor M. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 91
[3]   Data-free Knowledge Distillation for Object Detection [J].
Chawla, Akshay ;
Yin, Hongxu ;
Molchanov, Pavlo ;
Alvarez, Jose .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, :3288-3297
[4]  
Chen P.G, 2021, DISTILLING KNOWLEDGE, P1, DOI [10.48550/ArXiv.2104.09044, DOI 10.48550/ARXIV.2104.09044]
[5]   An improved Hoeffding's inequality for sum of independent random variables [J].
Cheng, Xueqin ;
Li, Yanpeng .
STATISTICS & PROBABILITY LETTERS, 2022, 183
[6]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[7]   General Instance Distillation for Object Detection [J].
Dai, Xing ;
Jiang, Zeren ;
Wu, Zhao ;
Bao, Yiping ;
Wang, Zhicheng ;
Liu, Si ;
Zhou, Erjin .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :7838-7847
[8]   A lightweight vehicles detection network model based on YOLOv5 [J].
Dong, Xudong ;
Yan, Shuai ;
Duan, Chaoqun .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 113
[9]   Microstructure, mechanical and corrosion properties of Mg-Zn-Nd alloy with different accumulative area reduction after room-temperature drawing [J].
Gao, Ming ;
Ma, Zheng ;
Etim, Iniobong P. ;
Tan, Li-Li ;
Yang, Ke .
RARE METALS, 2021, 40 (04) :897-907
[10]   Res2Net: A New Multi-Scale Backbone Architecture [J].
Gao, Shang-Hua ;
Cheng, Ming-Ming ;
Zhao, Kai ;
Zhang, Xin-Yu ;
Yang, Ming-Hsuan ;
Torr, Philip .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (02) :652-662