Extractive Knowledge Distillation

被引:4
|
作者
Kobayashi, Takumi [1 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, 1-1-1 Umezono, Tsukuba, Ibaraki, Japan
来源
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022) | 2022年
关键词
D O I
10.1109/WACV51458.2022.00142
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge distillation (KD) transfers knowledge of a teacher model to improve performance of a student model which is usually equipped with lower capacity. In the KD framework, however, it is unclear what kind of knowledge is effective and how it is transferred. This paper analyzes a KD process to explore the key factors. In a KD formulation, softmax temperature entangles three main components of student and teacher probabilities and a weight for KD, making it hard to analyze contributions of those factors separately. We disentangle those components so as to further analyze especially the temperature and improve the components respectively. Based on the analysis about temperature and uniformity of the teacher probability, we propose a method, called extractive distillation, for extracting effective knowledge from the teacher model. The extractive KD touches only teacher knowledge, thus being applicable to various KD methods. In the experiments on image classification tasks using Cifar-100 and TinylmageNet datasets, we demonstrate that the proposed method outperforms the other KD methods and analyze feature representation to show its effectiveness in the framework of transfer learning.
引用
收藏
页码:1350 / 1359
页数:10
相关论文
共 50 条
  • [31] Modeling of batch extractive distillation
    I. V. Ivanov
    V. A. Lotkhov
    N. N. Kulov
    Theoretical Foundations of Chemical Engineering, 2017, 51 : 253 - 261
  • [32] EXTRACTIVE DISTILLATION BY SALT EFFECT
    FURTER, WF
    ADVANCES IN CHEMISTRY SERIES, 1972, (115): : 35 - 45
  • [33] THE DEHYDRATION OF ETHANOL BY EXTRACTIVE DISTILLATION
    YEH, AI
    BERG, L
    CHEMICAL ENGINEERING COMMUNICATIONS, 1992, 113 : 147 - 153
  • [34] AZEOTROPIC EXTRACTIVE DISTILLATION OF BENZENE
    DURANDET, M
    MIKITENKO, P
    COHEN, G
    GRACCO, F
    ANDREWS, JW
    BONNIFAY, P
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1975, (169): : 15 - 15
  • [35] CHOOSING SOLVENTS FOR EXTRACTIVE DISTILLATION
    TASSIOS, D
    CHEMICAL ENGINEERING, 1969, 76 (03) : 118 - &
  • [36] Progressive extractive distillation process
    不详
    HYDROCARBON PROCESSING, 2000, 79 (11): : 33 - +
  • [37] SOLVENT SELECTION FOR EXTRACTIVE DISTILLATION
    KYLE, BG
    LENG, DE
    INDUSTRIAL AND ENGINEERING CHEMISTRY, 1965, 57 (02): : 43 - &
  • [38] Design of solvents for extractive distillation
    van Dyk, B
    Nieuwoudt, I
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2000, 39 (05) : 1423 - 1429
  • [39] EXTRACTIVE DISTILLATION - DESIGN AND APPLICATION
    CHAMBERS, JM
    CHEMICAL ENGINEERING PROGRESS, 1951, 47 (11) : 555 - 565
  • [40] EXTRACTIVE DISTILLATION RESEARCH IN LITHUANIA
    SATAS, D
    RUGIS, J
    INTERNATIONAL CHEMICAL ENGINEERING, 1967, 7 (03): : 438 - &