Class-dependent Pruning of Deep Neural Networks

被引：3

作者：

Entezari, Rahim ^{[1
]}

Saukh, Olga ^{[1
]}

机构：

[1] Graz Univ Technol, Inst Tech Informat, CSH Vienna, Graz, Austria

来源：

2020 IEEE SECOND WORKSHOP ON MACHINE LEARNING ON EDGE IN SENSOR SYSTEMS (SENSYS-ML 2020) | 2020年

关键词：

deep neural network compression; pruning; lottery ticket hypothesis; data imbalance; class imbalance;

D O I：

10.1109/SenSysML50931.2020.00010

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Today's deep neural networks require substantial computation resources for their training, storage and inference, which limits their effective use on resource-constrained devices. Many recent research activities explore different options for compressing and optimizing deep models. On the one hand, in many real-world applications we face the data imbalance challenge, i.e., when the number of labeled instances of one class considerably outweighs the number of labeled instances of the other class. On the other hand, applications may pose a class imbalance problem, i.e., higher number of false positives produced when training a model and optimizing its performance may be tolerable, yet the number of false negatives must stay low. The problem originates from the fact that some classes are more important for the application than others, e.g., detection problems in medical and surveillance domains. Motivated by the success of the lottery ticket hypothesis, in this paper we propose an iterative deep model compression technique, which keeps the number of false negatives of the compressed model close to the one of the original model at the price of increasing the number of false positives if necessary. Our experimental evaluation using two benchmark data sets shows that the resulting compressed sub-networks 1) achieve up to 35% lower number of false negatives than the compressed model without class optimization, 2) provide an overall higher AUC-ROC measure compared to conventional Lottery Ticket algorithm and three recent popular pruning methods, and 3) use up to 99% fewer parameters compared to the original network. The code is publicly available(1).

引用

页码：13 / 18

页数：6

共 50 条

[31] Magnitude and Uncertainty Pruning Criterion for Neural Networks [J].

Ko, Vinnie ;

Oehmcke, Stefan ;

Gieseke, Fabian .

2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, :2317-2326

[32] Flattening Layer Pruning in Convolutional Neural Networks [J].

Jeczmionek, Ernest ;

Kowalski, Piotr A. .

SYMMETRY-BASEL, 2021, 13 (07)

[33] DyPrune: Dynamic Pruning Rates for Neural Networks [J].

Aires Jonker, Richard Adolph ;

Poudel, Roshan ;

Fajarda, Olga ;

Oliveira, Jose Luis ;

Lopes, Rui Pedro ;

Matos, Sergio .

PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2023, PT I, 2023, 14115 :146-157

[34] Structured pruning of neural networks for constraints learning [J].

Cacciola, Matteo ;

Frangioni, Antonio ;

Lodi, Andrea .

OPERATIONS RESEARCH LETTERS, 2024, 57

[35] Evolving Better Initializations For Neural Networks With Pruning [J].

Zhou, Ryan ;

Hu, Ting .

PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2023 COMPANION, 2023, :703-706

[36] Automated Pruning of Neural Networks for Mobile Applications [J].

Glinserer, Andreas ;

Lechner, Martin ;

Wendt, Alexander .

2021 IEEE 19TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2021,

[37] Pruning neural networks for inductive conformal prediction [J].

Zhao, Xindi ;

Bellotti, Anthony .

CONFORMAL AND PROBABILISTIC PREDICTION WITH APPLICATIONS, VOL 179, 2022, 179

[38] Pruning Coherent Integrated Photonic Neural Networks [J].

Banerjee, Sanmitra ;

Nikdast, Mahdi ;

Pasricha, Sudeep ;

Chakrabarty, Krishnendu .

IEEE JOURNAL OF SELECTED TOPICS IN QUANTUM ELECTRONICS, 2023, 29 (02)

[39] PRUNING ARTIFICIAL NEURAL NETWORKS USING NEURAL COMPLEXITY MEASURES [J].

Jorgensen, Thomas D. ;

Haynes, Barry P. ;

Norlund, Charlotte C. F. .

INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2008, 18 (05) :389-403

[40] Dimensionality reduced training by pruning and freezing parts of a deep neural network: a survey [J].

Paul Wimmer ;

Jens Mehnert ;

Alexandru Paul Condurache .

Artificial Intelligence Review, 2023, 56 :14257-14295

← 1 2 3 4 5 →