BAG-OF-FEATURES-BASED KNOWLEDGE DISTILLATION FOR LIGHTWEIGHT CONVOLUTIONAL NEURAL NETWORKS

被引:0
作者
Chariton, Alexandros [1 ]
Passalis, Nikolaos [1 ]
Tefas, Anastasios [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Computat Intelligence & Deep Learning Grp, AIIA Lab, Thessaloniki, Greece
来源
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2022年
关键词
Knowledge Distillation; Bag-of-Features; Mutual Information; Convolutional Neural Networks;
D O I
10.1109/ICIP46576.2022.9897390
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge distillation enables us to transfer the knowledge from a large and complex neural network into a smaller and faster one. This allows for improving the accuracy of the smaller network. However, directly transferring the knowledge between enormous feature maps, as they are extracted from convolutional layers, is not straightforward. In this work, we propose an efficient mutual information-based approach for transferring the knowledge between feature maps extracted from different networks. The proposed method employs an efficient Neural Bag-of-Features formulation to estimate the joint and marginal probabilities and then optimizes the whole pipeline in an end-to-end manner. The effectiveness of the proposed method is demonstrated using a lightweight, fully convolutional neural network architecture, which aims toward high-resolution analysis and targets photonic neural network accelerators.
引用
收藏
页码:1541 / 1545
页数:5
相关论文
共 20 条
  • [1] Model Compression and Acceleration for Deep Neural Networks The principles, progress, and challenges
    Cheng, Yu
    Wang, Duo
    Zhou, Pan
    Zhang, Tao
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (01) : 126 - 136
  • [2] Cheng Yu, 2017, CoRR
  • [3] Glorot X, 2011, P 14 INT C ART INT S, V15, P315, DOI DOI 10.1002/ECS2.1832
  • [4] Greff K., 2016, IEEE transactions on neural networks and learning systems, V28, P2222, DOI [DOI 10.1109/TNNLS.2016.2582924, 10.1109/TNNLS.2016.2582924]
  • [5] Hinton G., 2015, ARXIV, V2
  • [6] Howard AG., 2017, ARXIV, DOI DOI 10.48550/ARXIV.1704.04861
  • [7] Searching for MobileNetV3
    Howard, Andrew
    Sandler, Mark
    Chu, Grace
    Chen, Liang-Chieh
    Chen, Bo
    Tan, Mingxing
    Wang, Weijun
    Zhu, Yukun
    Pang, Ruoming
    Vasudevan, Vijay
    Le, Quoc V.
    Adam, Hartwig
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1314 - 1324
  • [8] Product Quantization for Nearest Neighbor Search
    Jegou, Herve
    Douze, Matthijs
    Schmid, Cordelia
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (01) : 117 - 128
  • [9] Kingma DP, 2014, ADV NEUR IN, V27
  • [10] 1D convolutional neural networks and applications: A survey
    Kiranyaz, Serkan
    Avci, Onur
    Abdeljaber, Osama
    Ince, Turker
    Gabbouj, Moncef
    Inman, Daniel J.
    [J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2021, 151