KRR-CNN: kernels redundancy reduction in convolutional neural networks

被引：23

作者：

Hssayni, El Houssaine ^{[1
]}

Joudar, Nour-Eddine ^{[2
]}

Ettaouil, Mohamed ^{[1
]}

机构：

[1] Sidi Mohamed Ben Abdellah Univ, Fac Sci & Technol Fez, Dept Math, Modelling & Math Struct Lab, Fes, Morocco

[2] Mohammed V Univ Rabat, ENSAM, Dept Appl Math & Informat, M2CS,Res Ctr STIS, Rabat, Morocco

来源：

NEURAL COMPUTING & APPLICATIONS | 2022年 / 34卷 / 03期

关键词：

Convolutional neural networks; Convolution kernel; Binary optimization; Genetic algorithm; Image classification; OPTIMIZATION; ALGORITHM;

D O I：

10.1007/s00521-021-06540-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Convolutional neural networks (CNNs) are a promising tool for solving real-world problems. However, successful CNNs often require a large number of parameters, which leads to a significant amount of memory and a higher computational cost. This may produce some undesirable phenomena, notably the overfitting. Indeed, in CNNs, many kernels are usually redundant and can be eliminated from the network while preserving the performance. In this work, we propose a new optimization model for kernels redundancy reduction in CNN named KRR-CNN. It consists of minimization and optimization phases. In the first one, a dataset is used to train a specific CNN generating a learned CNN with optimal parameters. These later are combined with a decision optimization model to reduce kernels that have not contributed to the first task. The optimization phase is carried out by the evolutionary genetic algorithm. Efficiency of KRR-CNN has been demonstrated by several experiments. In fact, the suggested model allows reducing the kernels redundancy and improving the classification performance comparable to the state-of-the-art CNNs.

引用

页码：2443 / 2454

页数：12

共 41 条

[31] Transformed l1 regularization for learning sparse deep neural networks [J].

Ma, Rongrong ;

Miao, Jianyu ;

Niu, Lingfeng ;

Zhang, Peng .

NEURAL NETWORKS, 2019, 119 :286-298

[32] Application of deep learning to cybersecurity: A survey [J].

Mandavifar, Samaneh ;

Ghorbani, Ali A. .

NEUROCOMPUTING, 2019, 347 :149-176

[33] PSO-based optimized CNN for Hindi ASR [J].

Passricha, Vishal ;

Aggarwal, Rajesh Kumar .

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (04) :1123-1133

[34]

Sainath TN, 2013, INT CONF ACOUST SPEE, P6655, DOI 10.1109/ICASSP.2013.6638949

[35] A Survey of Deep Learning Techniques: Application in Wind and Solar Energy Resources [J].

Shamshirband, Shahab ;

Rabczuk, Timon ;

Chau, Kwok-Wing .

IEEE ACCESS, 2019, 7 :164650-164666

[36]

Simonyan K, 2015, Arxiv, DOI [arXiv:1409.1556, DOI 10.48550/ARXIV.1409.1556]

[37] SVD-based redundancy removal in 1-D CNNs for acoustic scene classification [J].

Singh, Arshdeep ;

Rajan, Padmanabhan ;

Bhavsar, Arnav .

PATTERN RECOGNITION LETTERS, 2020, 131 :383-389

[38] A deep learning framework for building energy consumption forecast [J].

Somu, Nivethitha ;

Raman, Gauthama M. R. ;

Ramamritham, Krithi .

RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2021, 137

[39]

Xiao, 2017, ARXIV

[40] Recent advances in convolutional neural network acceleration [J].

Zhang, Qianru ;

Zhang, Meng ;

Chen, Tinghuan ;

Sun, Zhifei ;

Ma, Yuzhe ;

Yu, Bei .

NEUROCOMPUTING, 2019, 323 :37-51

← 1 2 3 4 5 →