Automated Knowledge Distillation via Monte Carlo Tree Search

被引：14

作者：

Li, Lujun ^{[1
]}

Dong, Peijie ^{[2
]}

Wei, Zimian ^{[2
]}

Yang, Ya ^{[3
]}

机构：

[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China

[2] Natl Univ Def Technol, Changsha, Peoples R China

[3] City Univ Hong Kong, Hong Kong, Peoples R China

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年

关键词：

D O I：

10.1109/ICCV51070.2023.01597

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present Auto-KD, the first automated search framework for optimal knowledge distillation design. Traditional distillation techniques typically require handcrafted designs by experts and extensive tuning costs for different teacher-student pairs. To address these issues, we empirically study different distillers, finding that they can be decomposed, combined, and simplified. Based on these observations, we build our uniform search space with advanced operations in transformations, distance functions, and hyperparameters components. For instance, the transformation parts are optional for global, intra-spatial, and inter-spatial operations, such as attention, mask, and multi-scale. Then, we introduce an effective search strategy based on the Monte Carlo tree search, modeling the search space as a Monte Carlo Tree (MCT) to capture the dependency among options. The MCT is updated using test loss and representation gap of student trained by candidate distillers as the reward for better exploration-exploitation balance. To accelerate the search process, we exploit offline processing without teacher inference, sparse training for student, and proxy settings based on distillation properties. In this way, our Auto-KD only needs small costs to search for optimal distillers before the distillation phase. Moreover, we expand Auto-KD for multi-layer and multi-teacher scenarios with training-free weighted factors. Our method is promising yet practical, and extensive experiments demonstrate that it generalizes well to different CNNs and Vision Transformer models and attains state-of-the-art performance across a range of vision tasks, including image classification, object detection, and semantic segmentation. Code is provided at https://github.com/lilujunai/Auto-KD.

引用

页码：17367 / 17378

页数：12

共 50 条

[11] Monte Carlo Tree Search in Hex
Arneson, Broderick
Hayward, Ryan B.
Henderson, Philip
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2010, 2 (04) : 251 - 258
[12] Monte Carlo tree search in Kriegspiel
Ciancarini, Paolo
Favini, Gian Piero
ARTIFICIAL INTELLIGENCE, 2010, 174 (11) : 670 - 684
[13] MONTE CARLO TREE SEARCH: A TUTORIAL
Fu, Michael C.
2018 WINTER SIMULATION CONFERENCE (WSC), 2018, : 222 - 236
[14] Monte Carlo Tree Search for Quoridor
Respall, Victor Massague
Brown, Joseph Alexander
Aslam, Hamna
19TH INTERNATIONAL CONFERENCE ON INTELLIGENT GAMES AND SIMULATION (GAME-ON(R) 2018), 2018, : 5 - 9
[15] An Analysis of Monte Carlo Tree Search
James, Steven
Konidaris, George
Rosman, Benjamin
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3576 - 3582
[16] Decentralized Cooperative Planning for Automated Vehicles with Continuous Monte Carlo Tree Search
Kurzer, Karl
Engelhorn, Florian
Zoellner, J. Marius
2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 452 - 459
[17] Decentralized Cooperative Planning for Automated Vehicles with Hierarchical Monte Carlo Tree Search
Kurzer, Karl
Zhou, Chenyang
Zoellner, J. Marius
2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 529 - 536
[18] Vine copula structure learning via Monte Carlo tree search
Chang, Bo
Pan, Shenyi
Joe, Harry
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 353 - 361
[19] Scalable Safe Policy Improvement via Monte Carlo Tree Search
Castellini, Alberto
Bianchi, Federico
Zorzi, Edoardo
Simao, Thiago D.
Farinelli, Alessandro
Spaan, Matthijs T. J.
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
[20] Hedging of financial derivative contracts via Monte Carlo tree search
Szehr, Oleg
JOURNAL OF COMPUTATIONAL FINANCE, 2023, 27 (02) : 47 - 80

← 1 2 3 4 5 →