FALF ConvNets: Fatuous auxiliary loss based filter-pruning for efficient deep CNNs

被引：14

作者：

Singh, Pravendra ^{[1
]}

Kadi, Vinay Sameer Raja ^{[2
]}

Namboodiri, Vinay P. ^{[1
]}

机构：

[1] Indian Inst Technol Kanpur, Dept Comp Sci & Engn, Kanpur, Uttar Pradesh, India

[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

来源：

IMAGE AND VISION COMPUTING | 2020年 / 93卷

关键词：

Filter pruning; Model compression; Convolutional neural network; Image recognition; Deep learning;

D O I：

10.1016/j.imavis.2019.103857

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Obtaining efficient Convolutional Neural Networks (CNNs) are imperative to enable their application for a wide variety of tasks (classification, detection, etc.). While several methods have been proposed to solve this problem, we propose a novel strategy for solving the same that is orthogonal to the strategies proposed so far. We hypothesize that if we add a fatuous auxiliary task, to a network which aims to solve a semantic task such as classification or detection, the filters devoted to solving this frivolous task would not be relevant for solving the main task of concern. These filters could be pruned and pruning these would not reduce the performance on the original task. We demonstrate that this strategy is not only successful, it in fact allows for improved performance for a variety of tasks such as object classification, detection and action recognition. An interesting observation is that the task needs to be fatuous so that any semantically meaningful filters would not be relevant for solving this task. We thoroughly evaluate our proposed approach on different architectures (LeNet, VGG-16, ResNet, Faster RCNN, SSD-512, C3D, and MobileNet V2) and datasets (MNIST, CIFAR, ImageNet, GTSDB, COCO, and UCF101) and demonstrate its generalizability through extensive experiments. Moreover, our compressed models can be used at run-time without requiring any special libraries or hardware. Our model compression method reduces the number of FLOPS by an impressive factor of 6.03X and GPU memory footprint by more than 17X for VGG-16, significantly outperforming other state-of-the-art filter pruning methods. We demonstrate the usability of our approach for 3D convolutions and various vision tasks such as object classification, object detection, and action recognition. (C) 2019 Elsevier B.V. All rights reserved.

引用

页数：14

共 33 条

[31] Efficient Kalman filter based deep learning approaches for workload prediction in cloud and edge environments
Kumar, M. R. Naveen
Annappa, B.
Yadav, Vishwas
COMPUTING, 2025, 107 (01)
[32] Gaussian filter facilitated deep learning-based architecture for accurate and efficient liver tumor segmentation for radiation therapy
Lin, Hongyu
Zhao, Min
Zhu, Lingling
Pei, Xi
Wu, Haotian
Zhang, Lian
Li, Ying
FRONTIERS IN ONCOLOGY, 2024, 14
[33] An Efficient Satellite Images Classification Approach Based on Fuzzy Cognitive Map Integration With Deep Learning Models Using Improved Loss Function
Karakose, Ebru
IEEE ACCESS, 2024, 12 : 141361 - 141379

← 1 2 3 4 →