Compressing Deep Model With Pruning and Tucker Decomposition for Smart Embedded Systems

被引:18
作者
Dai, Cheng [1 ]
Liu, Xingang [2 ]
Cheng, Hongqiang [2 ]
Yang, Laurence T. [3 ]
Deen, M. Jamal [4 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610017, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
[3] St Francis Xavier Univ, Dept Comp Sci, Antigonish, NS B2G 2W5, Canada
[4] McMaster Univ, Dept Elect Engn & Comp Sci, Hamilton, ON L8S 4K1, Canada
基金
中国国家自然科学基金;
关键词
Computational modeling; Deep learning; Bayes methods; Internet of Things; Data models; Streaming media; Edge computing; Deep model compression; parameter pruning; smart embedded systems; Tucker decomposition (TD); IOT;
D O I
10.1109/JIOT.2021.3116316
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning has been proved to be one of the most effective method in feature encoding for different intelligent applications such as video-based human action recognition. However, its nonconvex optimization mechanism leads large memory consumption, which hinders its deployment on the smart embedded systems with limited computational resources. To overcome this challenge, we propose a novel deep model compression technique for smart embedded systems, which realizes both the memory size reduction and inference complexity decrease within a small drop of accuracy. First, we propose an improved naive Bayes inference-based channel parameter pruning to obtain a sparse model with higher accuracy. Then, to improve the inference efficiency, the improved Tucker decomposition method is proposed, where an improved genetic algorithm is used to optimize the Tucker ranks. Finally, to elevate the effectiveness of our proposed method, extensive experiments are conducted. The experimental results show that our method can achieve the state-of-the-art performance compared with existing methods in terms of accuracy, parameter compression, and floating-point operations reduction.
引用
收藏
页码:14490 / 14500
页数:11
相关论文
共 56 条
[31]   HRank: Filter Pruning using High-Rank Feature Map [J].
Lin, Mingbao ;
Ji, Rongrong ;
Wang, Yan ;
Zhang, Yichen ;
Zhang, Baochang ;
Tian, Yonghong ;
Shao, Ling .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :1526-1535
[32]  
Lin SH, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2425
[33]   Towards Optimal Structured CNN Pruning via Generative Adversarial Learning [J].
Lin, Shaohui ;
Ji, Rongrong ;
Yan, Chenqian ;
Zhang, Baochang ;
Cao, Liujuan ;
Ye, Qixiang ;
Huang, Feiyue ;
Doermann, David .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2785-2794
[34]   Holistic CNN Compression via Low-Rank Decomposition with Knowledge Transfer [J].
Lin, Shaohui ;
Ji, Rongrong ;
Chen, Chao ;
Tao, Dacheng ;
Luo, Jiebo .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (12) :2889-2905
[35]  
Liu N, 2020, AAAI CONF ARTIF INTE, V34, P4876
[36]   Intelligent Edge Computing for IoT-Based Energy Management in Smart Cities [J].
Liu, Yi ;
Yang, Chao ;
Jiang, Li ;
Xie, Shengli ;
Zhang, Yan .
IEEE NETWORK, 2019, 33 (02) :111-117
[37]  
Liu Z., 2019, PROC INT C LEARN REP, P1
[38]  
Liu ZH, 2018, ADV NEUR IN, V31
[39]   Learning Efficient Convolutional Networks through Network Slimming [J].
Liu, Zhuang ;
Li, Jianguo ;
Shen, Zhiqiang ;
Huang, Gao ;
Yan, Shoumeng ;
Zhang, Changshui .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2755-2763
[40]   AutoPruner: An end-to-end trainable filter pruning method for efficient deep model inference [J].
Luo, Jian-Hao ;
Wu, Jianxin .
PATTERN RECOGNITION, 2020, 107