Distilled Center and Scale Prediction: Distilling anchor-free pedestrian detector for edge computing

被引:0
作者
Wang, Jianyuan [1 ]
She, Liang [2 ,3 ]
Wang, Wei [2 ]
Liu, Xinyue [4 ]
Zeng, Yangyan [5 ]
机构
[1] Univ Sci & Technol Beijing, Sch Intelligence Sci & Technol, Beijing 100183, Peoples R China
[2] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China
[3] Hunan Univ Technol & Business, Sch Comp Sci, Changsha 410205, Peoples R China
[4] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[5] Hunan Univ Technol & Business, Sch Frontier Crossover Studies, Changsha 410205, Peoples R China
基金
中国国家自然科学基金;
关键词
Pedestrian detection; Knowledge distillation; Internet of Things; Edge computing; NETWORK;
D O I
10.1016/j.iot.2024.101444
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As an important task of the Internet of Things, pedestrian detection has attained good detection results powered by deep learning. However, common pedestrian detectors based on deep learning usually require lots of computing resources and consume large amounts of energy, making them not suitable for edge devices in the edge computing paradigm. Therefore, for the pedestrian detection task in edge computing, we consider knowledge distillation on the anchor-free detector Center and Scale Prediction to ensure the pedestrian detection performance while reducing the parameter number and inference time of the detector as much as possible. We propose a distillation framework Distilled Center and Scale Prediction, in which we implement feature-based and response-based distillation for transferring knowledge from the larger model to the smaller model. In order to transmit information useful for detection in distillation as much as possible, Multi-Reference Distillation is designed to filter the transferred knowledge. Moreover, Cross-Module Distillation is proposed to enhance the transfer of relational information during distillation. We perform related experiments on the CityPersons dataset. Our proposed distilled detector achieves 10.45% MR-2 with ResNet18 as backbone and 10.27% MR-2 with ResNet50 as backbone, even outperforming the original teacher detectors. At the same time, the inference time per image is reduced by more than 10% compared to the original teacher detector.
引用
收藏
页数:15
相关论文
共 74 条
[41]  
Romero A, 2015, Arxiv, DOI arXiv:1412.6550
[42]   Development of edge computing and classification using The Internet of Things with incremental learning for object detection [J].
Shitharth, S. ;
Manoharan, Hariprasath ;
Alsowail, Rakan A. ;
Shankar, Achyut ;
Pandiaraj, Saravanan ;
Maple, Carsten ;
Jeon, Gwanggil .
INTERNET OF THINGS, 2023, 23
[43]  
Simonyan K, 2015, Arxiv, DOI [arXiv:1409.1556, DOI 10.48550/ARXIV.1409.1556, 10.3390/s21082852]
[44]   Small-Scale Pedestrian Detection Based on Topological Line Localization and Temporal Feature Aggregation [J].
Song, Tao ;
Sun, Leiyu ;
Xie, Di ;
Sun, Haiming ;
Pu, Shiliang .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :554-569
[45]   PRNet plus plus : Learning towards generalized occluded pedestrian detection via progressive refinement network [J].
Song, Xiaolin ;
Chen, Binghui ;
Li, Pengyu ;
Wang, Biao ;
Zhang, Honggang .
NEUROCOMPUTING, 2022, 482 :98-115
[46]  
Sun RY, 2020, Arxiv, DOI [arXiv:2006.13108, 10.48550/arXiv.2006.13108]
[47]  
Ullah Akram S., 2022, arXiv
[48]   Robust real-time face detection [J].
Viola, P ;
Jones, MJ .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 57 (02) :137-154
[49]  
Wang C., 2024, IEEE Trans. Consum. Electron., early access, P1, DOI [10.1109/TCE.2024.3375859, DOI 10.1109/TCE.2024.3375859]
[50]   Distilling Object Detectors With Fine-Grained Feature Imitation [J].
Wang, Tao ;
Yuan, Li ;
Zhang, Xiaopeng ;
Feng, Jiashi .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4928-4937