CIMTD: Class Incremental Multi-Teacher Knowledge Distillation for Fractal Object Detection

被引:0
作者
Wu, Chuhan [1 ]
Luo, Xiaochuan [1 ,2 ]
Huang, Haoran [1 ]
Zhang, Yulin [3 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R China
[2] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China
[3] Univ Picardie Jules Verne, Amiens, France
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XII | 2025年 / 15042卷
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Fractal Object Detection; Multi-Teacher Knowledge Distillation; IGBT board;
D O I
10.1007/978-981-97-8858-3_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In practical industrial vision inspection tasks, acquiring foreign object data proves challenging due to its scarcity and unique characteristics. Industrial foreign objects often exhibit fractal shapes, with rough or fragmented geometrical contours, making them difficult to discern. Given the continuous emergence of new industrial foreign objects on production lines, ensuring the accuracy and speed of detecting both base and novel foreign objects with a limited number of samples is imperative. To address this challenge, we propose an end-to-end multi-teacher knowledge distillation detection framework (CIMTD) based on YOLO. By incorporating the Wasserstein Distance of teacher confidence, the limitation of precise point-to-point matching inherent in KL divergence in traditional knowledge distillation methods has been mitigated. This approach enables students to systematically and logically grasp the knowledge imparted by teachers. Additionally, our design choice of multi-teacher padding instead of cropping allows the distillation network to address the class increment problem at the strategy level. Adaptive regression distillation empowers the model to autonomously determine whether to learn bounding box information from teachers based on discrepancies between teacher and student models, enhancing detection speed without compromising accuracy. Extensive experiments conducted on the IGBT surface foreign body dataset underscore the potential of our module design and strategy implementation, achieving 59.38% mAP, outperforming existing class incremental knowledge distillation methods. Our framework provides a more deployable industrial vision solution for edge-side devices with limited computational resources.
引用
收藏
页码:51 / 65
页数:15
相关论文
共 22 条
[1]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[2]   A Comprehensive Overhaul of Feature Distillation [J].
Heo, Byeongho ;
Kim, Jeesoo ;
Yun, Sangdoo ;
Park, Hyojin ;
Kwak, Nojun ;
Choi, Jin Young .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1921-1930
[3]  
Hinton G, 2015, Arxiv, DOI arXiv:1503.02531
[4]   A Holistic Approach to IGBT Board Surface Fractal Object Detection Based on the Multi-Head Model [J].
Huang, Haoran ;
Luo, Xiaochuan .
MACHINES, 2022, 10 (08)
[5]  
Huang T, 2022, ADV NEUR IN
[6]   Surface Defect Detection Model for Aero-Engine Components Based on Improved YOLOv5 [J].
Li, Xin ;
Wang, Cheng ;
Ju, Haijuan ;
Li, Zhuoyue .
APPLIED SCIENCES-BASEL, 2022, 12 (14)
[7]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37
[8]   A supervised approach for automated surface defect detection in ceramic tile quality control [J].
Lu, Qinghua ;
Lin, Junmeng ;
Luo, Lufeng ;
Zhang, Yunzhi ;
Zhu, Wenbo .
ADVANCED ENGINEERING INFORMATICS, 2022, 53
[9]   THE DISTANCE BETWEEN 2 RANDOM VECTORS WITH GIVEN DISPERSION MATRICES [J].
OLKIN, I ;
PUKELSHEIM, F .
LINEAR ALGEBRA AND ITS APPLICATIONS, 1982, 48 (DEC) :257-263
[10]   Relational Knowledge Distillation [J].
Park, Wonpyo ;
Kim, Dongju ;
Lu, Yan ;
Cho, Minsu .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3962-3971