QueryNet: Querying neural networks for lightweight specialized models

被引:5
作者
Jin, Yeong-Hwa [1 ]
Lee, Keon-Ho [1 ]
Choi, Dong-Wan [1 ]
机构
[1] Inha Univ, Dept Comp Sci & Engn, Incheon, South Korea
基金
新加坡国家研究基金会;
关键词
Querying Neural Networks; Model Specialization; Lightweight Models;
D O I
10.1016/j.ins.2021.12.097
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite the active research on deep learning these days, no existing works attempt to query neural networks for a specific task so that we can quickly obtain a model specialized for the queried task. This paper presents the first study on the problem of querying neural networks, aiming to efficiently find a lightweight model for any on-demand sub-task supported by a large and generic neural network. This problem is well motivated by the fact that such a lightweight model is particularly suitable for deployment in commodity mobile devices with limited computing resources. In this paper, we propose QueryNet, a framework of queriable neural networks, which is based on our class-aware channel pruning technique and the proposed method of merging multiple tiny networks for producing a specialized model for the task. Through the extensive experiments, we show that QueryNet can generate a lightweight neural network for a given task up to 3.25 times faster than learning a specialized model from scratch, and the resulting specialized neural network carries up to 37 times less parameters than a pretrained network, and even shows a slightly better accuracy than that of the pretrained model. (C) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:186 / 198
页数:13
相关论文
共 35 条
[1]  
[Anonymous], mixed quantum states with deep neural net
[2]   Pushing the AI Envelope [J].
Bazrafkan, Shabab ;
Corcoran, Peter .
IEEE CONSUMER ELECTRONICS MAGAZINE, 2018, 7 (02) :55-61
[3]   Block change learning for knowledge distillation [J].
Choi, Hyunguk ;
Lee, Younkwan ;
Yow, Kin Choong ;
Jeon, Moongu .
INFORMATION SCIENCES, 2020, 513 :360-371
[4]  
Chou YM, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2049
[5]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[6]   Pruning Deep Convolutional Neural Networks Architectures with Evolution Strategy [J].
Fernandes, Francisco E., Jr. ;
Yen, Gary G. .
INFORMATION SCIENCES, 2021, 552 :29-47
[7]   Pruning of generative adversarial neural networks for medical imaging diagnostics with evolution strategy [J].
Fernandes Jr, Francisco Erivaldo ;
Yen, Gary G. .
INFORMATION SCIENCES, 2021, 558 :91-102
[8]  
Fischer M, 2019, PR MACH LEARN RES, V97
[9]  
Han S., 2016, INT C LEARNING REPRE
[10]  
Han S, 2016, Harvard Yenching Ins, V101, P123