Discriminative neural network pruning in a multiclass environment: A case study in spoken emotion recognition

被引：7

作者：

Sanchez-Gutierrez, Maximo E. ^{[1
]}

Gonzalez-Perez, Pedro P. ^{[2
]}

机构：

[1] Univ Politecn Penjamo, El Derramadero, Mexico

[2] Univ Autonoma Metropolitana Cuajimalpa, Mexico City, DF, Mexico

来源：

SPEECH COMMUNICATION | 2020年 / 120卷

关键词：

Restricted Boltzmann machines; Pruning; Discriminative information; Deep learning; Emotion recognition; DEEP; ALGORITHM; CLASSIFICATION;

D O I：

10.1016/j.specom.2020.03.006

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Deep learning has become one of the most widely accepted paradigms regarding machine learning. It focuses on the use of hierarchical data models and builds upon the notion that in order to learn about high level data representations, a better understanding of intermediate level representation is needed. Restricted Boltzmann Machines and deep belief networks are two main types of deep learning algorithms commonly used in a wide array of classification and pattern recognition tasks. Examples of these tasks are natural language recognition, neuroimaging studies, forecasting time series, parametric voice synthesis, and speech emotion recognition among others. Recent machine learning studies suggest that deep learning networks can help map feature problems into a more advantageous position, hence improving the classification process. However, selecting a suitable Deep learning architecture in response to a specific problem can be difficult. In this study, we intend to investigate whether discriminative measures, such as Anova, Pearsonas Correlation, Fisher score, Gain ratio, ReliefF, OneR among others, could offer pointers to identify useful neural nods in a Deep learning network. This is due to the fact that normally not all hidden neurons provide insightful information for a classification task. Our approach consists in using some of these discriminative measures to rank the hidden neurons based on their output values, and then prune them in accordance to their position within said ranking. Our results indicate that this approach is also helpful in multiclass classification problems and the pruning process seems to have a positive effect in diminishing the resulting error rate.

引用

页码：20 / 30

页数：11

共 49 条

[1]

Albornoz EM, 2014, LECT NOTES COMPUT SC, V8827, P104, DOI 10.1007/978-3-319-12568-8_13

[2] Spoken emotion recognition using hierarchical classifiers [J].

Albornoz, Enrique M. ;

Milone, Diego H. ;

Rufiner, Hugo L. .

COMPUTER SPEECH AND LANGUAGE, 2011, 25 (03) :556-570

[3]

[Anonymous], 1986, P 1986 PARALLEL DIST

[4]

[Anonymous], NIPS 2014 WORKSH MON

[5] Learning Deep Architectures for AI [J].

Bengio, Yoshua .

FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127

[6]

Borchert M, 2005, Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), P147

[7]

Brown G, 2012, J MACH LEARN RES, V13, P27

[8]

Brueckner R, 2012, 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, P290

[9]

Burkhardt F., 2005, P 9 EUR C SPEECH COM, VVolume 5, P1517, DOI DOI 10.21437/INTERSPEECH.2005-446

[10] Image classification based on effective extreme learning machine [J].

Cao, Feilong ;

Liu, Bo ;

Park, Dong Sun .

NEUROCOMPUTING, 2013, 102 :90-97

← 1 2 3 4 5 →