PSO-based optimized CNN for Hindi ASR

被引：23

作者：

Passricha, Vishal ^{[1
]}

Aggarwal, Rajesh Kumar ^{[1
]}

机构：

[1] Natl Inst Technol, Kurukshetra, Haryana, India

来源：

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY | 2019年 / 22卷 / 04期

关键词：

CNN; Hyperparameter selection; PSO; Optimization; CONVOLUTIONAL NEURAL-NETWORKS; SPEECH;

D O I：

10.1007/s10772-019-09652-3

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Convolutional Neural Network (CNN) is one of the successful deep learning algorithms that have shown its effectiveness in a variety of vision tasks. The performance of this network depends directly on its hyperparameters. Although, designing CNN architectures require expert knowledge of their intrinsic structure or a lot of trial and error. To overcome these issues, there is a need to automatically design the optimal architecture of CNNs without any human intervention. So, we try to eliminate the constraints on the number of convolutional layers and pooling layers and their type etc. from traditional architecture. Biologically inspired approaches have not been extensively exploited for this task. This paper attempts to automatically optimize CNN architecture's hyperparameters for speech recognition task based on particle swarm optimization (PSO) which is a population based stochastic optimization technique. The proposed method is evaluated by designing CNN architecture for speech recognition task on Hindi dataset. The experimental results show that the proposed method significantly designs the competitive CNN architecture which performs similar as other state-of-the-art methods.

引用

页码：1123 / 1133

页数：11

共 40 条

[1] Convolutional Neural Networks for Speech Recognition [J].