Automated Knowledge Distillation via Monte Carlo Tree Search

被引:14
|
作者
Li, Lujun [1 ]
Dong, Peijie [2 ]
Wei, Zimian [2 ]
Yang, Ya [3 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Natl Univ Def Technol, Changsha, Peoples R China
[3] City Univ Hong Kong, Hong Kong, Peoples R China
来源
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年
关键词
D O I
10.1109/ICCV51070.2023.01597
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present Auto-KD, the first automated search framework for optimal knowledge distillation design. Traditional distillation techniques typically require handcrafted designs by experts and extensive tuning costs for different teacher-student pairs. To address these issues, we empirically study different distillers, finding that they can be decomposed, combined, and simplified. Based on these observations, we build our uniform search space with advanced operations in transformations, distance functions, and hyperparameters components. For instance, the transformation parts are optional for global, intra-spatial, and inter-spatial operations, such as attention, mask, and multi-scale. Then, we introduce an effective search strategy based on the Monte Carlo tree search, modeling the search space as a Monte Carlo Tree (MCT) to capture the dependency among options. The MCT is updated using test loss and representation gap of student trained by candidate distillers as the reward for better exploration-exploitation balance. To accelerate the search process, we exploit offline processing without teacher inference, sparse training for student, and proxy settings based on distillation properties. In this way, our Auto-KD only needs small costs to search for optimal distillers before the distillation phase. Moreover, we expand Auto-KD for multi-layer and multi-teacher scenarios with training-free weighted factors. Our method is promising yet practical, and extensive experiments demonstrate that it generalizes well to different CNNs and Vision Transformer models and attains state-of-the-art performance across a range of vision tasks, including image classification, object detection, and semantic segmentation. Code is provided at https://github.com/lilujunai/Auto-KD.
引用
收藏
页码:17367 / 17378
页数:12
相关论文
共 50 条
  • [21] Accelerating Cooperative Planning for Automated Vehicles with Learned Heuristics and Monte Carlo Tree Search
    Kurzer, Karl
    Fechner, Marcus
    Zoellner, J. Marius
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 1726 - 1733
  • [22] Automated conceptual design of mechanisms based on Thompson Sampling and Monte Carlo Tree Search
    Mao, Jiangmin
    Zhu, Yingdan
    Chen, Gang
    Yan, Chun
    Zhang, Wuxiang
    APPLIED SOFT COMPUTING, 2025, 170
  • [23] Optimal state space reconstruction via Monte Carlo decision tree search
    K. Hauke Kraemer
    Maximilian Gelbrecht
    Induja Pavithran
    R. I. Sujith
    Norbert Marwan
    Nonlinear Dynamics, 2022, 108 : 1525 - 1545
  • [24] Approximation Methods for Monte Carlo Tree Search
    Aksenov, Kirill
    Panov, Aleksandr, I
    PROCEEDINGS OF THE FOURTH INTERNATIONAL SCIENTIFIC CONFERENCE INTELLIGENT INFORMATION TECHNOLOGIES FOR INDUSTRY (IITI'19), 2020, 1156 : 68 - 74
  • [25] A TUTORIAL INTRODUCTION TO MONTE CARLO TREE SEARCH
    Fu, Michael C.
    2020 WINTER SIMULATION CONFERENCE (WSC), 2020, : 1178 - 1193
  • [26] Optimal state space reconstruction via Monte Carlo decision tree search
    Kraemer, K. Hauke
    Gelbrecht, Maximilian
    Pavithran, Induja
    Sujith, R.I.
    Marwan, Norbert
    Nonlinear Dynamics, 2022, 108 (02): : 1525 - 1545
  • [27] Monte-Carlo Tree Search for Logistics
    Edelkamp, Stefan
    Gath, Max
    Greulich, Christoph
    Humann, Malte
    Herzog, Otthein
    Lawo, Michael
    COMMERCIAL TRANSPORT, 2016, : 427 - 440
  • [28] LinUCB applied to Monte Carlo tree search
    Mandai, Yusaku
    Kaneko, Tomoyuki
    THEORETICAL COMPUTER SCIENCE, 2016, 644 : 114 - 126
  • [29] Monte Carlo Tree Search for Trading and Hedging
    Vittori, Edoardo
    Likmeta, Amarildo
    Restelli, Marcello
    ICAIF 2021: THE SECOND ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, 2021,
  • [30] A Survey of Monte Carlo Tree Search Methods
    Browne, Cameron B.
    Powley, Edward
    Whitehouse, Daniel
    Lucas, Simon M.
    Cowling, Peter I.
    Rohlfshagen, Philipp
    Tavener, Stephen
    Perez, Diego
    Samothrakis, Spyridon
    Colton, Simon
    IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) : 1 - 43