Deep network compression based on partial least squares

被引：9

作者：

Jordao, Artur ^{[1
]}

Yamada, Fernando ^{[1
]}

Schwartz, William Robson ^{[1
]}

机构：

[1] Univ Fed Minas Gerais, Comp Sci Dept, Smart Sense Lab, Belo Horizonte, MG, Brazil

来源：

NEUROCOMPUTING | 2020年 / 406卷

关键词：

Convolutional networks compression; Partial least squares; Pruning convolutional networks;

D O I：

10.1016/j.neucom.2020.03.108

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Modern visual pattern recognition methods are based on convolutional networks since they are able to learn complex patterns directly from the data. However, convolutional networks are computationally expensive in terms of floating point operations (FLOPs), energy consumption and memory requirements, which hinder their deployment on low-power and resource-constrained systems. To address this problem, many works have proposed pruning strategies, which remove neurons (i.e., filters) in convolutional networks to reduce their computational cost. Despite achieving remarkable results, existing pruning approaches are ineffective since the accuracy of the network is degraded. This loss in accuracy is an effect of the criterion used to remove filters, as it may result in the removal of the filters with high influence to the classification ability of the network. Motivated by this, we propose an approach that eliminates filters based on the relationship of their outputs with the class label, on a low-dimensional space. This relationship is captured using Partial Least Squares (PLS), a discriminative feature projection method. Due to the nature of PLS, our method focuses on keeping discriminative filters. As a consequence, we are able to remove up to 60% of FLOPs while improving network accuracy. We show that our criterion is superior to existing pruning criteria, which include state-of-the-art feature selection techniques and handcrafted approaches. Compared to state-of-the-art pruning strategies, our method achieves the best tradeoff between drop/improvement in accuracy and FLOPs reduction. © 2020 Elsevier B.V.

引用

页码：234 / 243

页数：10

共 43 条

[1] Partial least squares regression and projection on latent structure regression (PLS Regression)
Abdi, Herve
[J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (01): : 97 - 106
[2] [Anonymous], 2017, ICLR 17
[3] [Anonymous], 2018, P BRIT MACH VIS C BM
[4] [Anonymous], 2018, P ECCV WORKSHOPS
[5] Brendel W., 2019, P INT C LEARN REPR I
[6] Bulat A., 2019, BMVC
[7] Cai H., 2019, P INT C LEARN REPR, P1
[8] VGGFace2: A dataset for recognising faces across pose and age
Cao, Qiong
Shen, Li
Xie, Weidi
Parkhi, Omkar M.
Zisserman, Andrew
[J]. PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 67 - 74
[9] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[10] Donahue J, 2014, PR MACH LEARN RES, V32

← 1 2 3 4 5 →