Automated Knowledge Distillation via Monte Carlo Tree Search

被引:14
|
作者
Li, Lujun [1 ]
Dong, Peijie [2 ]
Wei, Zimian [2 ]
Yang, Ya [3 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Natl Univ Def Technol, Changsha, Peoples R China
[3] City Univ Hong Kong, Hong Kong, Peoples R China
来源
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年
关键词
D O I
10.1109/ICCV51070.2023.01597
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present Auto-KD, the first automated search framework for optimal knowledge distillation design. Traditional distillation techniques typically require handcrafted designs by experts and extensive tuning costs for different teacher-student pairs. To address these issues, we empirically study different distillers, finding that they can be decomposed, combined, and simplified. Based on these observations, we build our uniform search space with advanced operations in transformations, distance functions, and hyperparameters components. For instance, the transformation parts are optional for global, intra-spatial, and inter-spatial operations, such as attention, mask, and multi-scale. Then, we introduce an effective search strategy based on the Monte Carlo tree search, modeling the search space as a Monte Carlo Tree (MCT) to capture the dependency among options. The MCT is updated using test loss and representation gap of student trained by candidate distillers as the reward for better exploration-exploitation balance. To accelerate the search process, we exploit offline processing without teacher inference, sparse training for student, and proxy settings based on distillation properties. In this way, our Auto-KD only needs small costs to search for optimal distillers before the distillation phase. Moreover, we expand Auto-KD for multi-layer and multi-teacher scenarios with training-free weighted factors. Our method is promising yet practical, and extensive experiments demonstrate that it generalizes well to different CNNs and Vision Transformer models and attains state-of-the-art performance across a range of vision tasks, including image classification, object detection, and semantic segmentation. Code is provided at https://github.com/lilujunai/Auto-KD.
引用
收藏
页码:17367 / 17378
页数:12
相关论文
共 50 条
  • [1] Automated Machine Learning with Monte-Carlo Tree Search
    Rakotoarison, Herilalaina
    Schoenauer, Marc
    Sebag, Michele
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3296 - 3303
  • [2] Automated Quantum Circuit Design With Nested Monte Carlo Tree Search
    Wang, Peiyong
    Usman, Muhammad
    Parampalli, Udaya
    Hollenberg, Lloyd C. L.
    Myers, Casey R.
    IEEE TRANSACTIONS ON QUANTUM ENGINEERING, 2023, 4
  • [3] Virtual Network Embedding via Monte Carlo Tree Search
    Haeri, Soroush
    Trajkovic, Ljiljana
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (02) : 510 - 521
  • [4] Extracting Knowledge from Web Text with Monte Carlo Tree Search
    Liu, Guiliang
    Li, Xu
    Wang, Jiakang
    Sun, Mingming
    Li, Ping
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 2585 - 2591
  • [5] Efficiency of Static Knowledge Bias in Monte-Carlo Tree Search
    Ikeda, Kokolo
    Viennot, Simon
    COMPUTERS AND GAMES, CG 2013, 2014, 8427 : 26 - 38
  • [6] Knowledge complement for Monte Carlo Tree Search: an application to combinatorial games
    Fabbri, Andre
    Armetta, Frederic
    Duchene, Eric
    Hassas, Salima
    2014 IEEE 26TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2014, : 997 - 1003
  • [7] Adding Expert Knowledge and Exploration in Monte-Carlo Tree Search
    Chaslot, Guillaume
    Fiter, Christophe
    Hoock, Jean-Baptiste
    Rimmel, Arpad
    Teytaud, Olivier
    ADVANCES IN COMPUTER GAMES, 2010, 6048 : 1 - +
  • [8] Multiagent Monte Carlo Tree Search
    Zerbel, Nicholas
    Yliniemi, Logan
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2309 - 2311
  • [9] Monte Carlo Tree Search with Metaheuristics
    Mandziuk, Jacek
    Walczak, Patryk
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2023, PT II, 2023, 14126 : 134 - 144
  • [10] Elastic Monte Carlo Tree Search
    Xu, Linjie
    Dockhorn, Alexander
    Perez-Liebana, Diego
    IEEE TRANSACTIONS ON GAMES, 2023, 15 (04) : 527 - 537