Glowworm swarm based fuzzy classifier with dual features for speech emotion recognition

被引:1
作者
Rajasekhar, B. [1 ]
Kamaraju, M. [2 ]
Sumalatha, V [1 ]
机构
[1] Jawaharlal Nehru Technol Univ Ananthapur, Ananthapuramu, Andhra Pradesh, India
[2] Gudlavalleru Engn Coll, Gudlavalleru, Andhra Pradesh, India
关键词
Emotion recognition; Gender recognition; Speech signal; Fuzzy classifier; Glowworm swarm optimization; GENDER; ALGORITHM; NETWORKS; ROBUST;
D O I
10.1007/s12065-019-00262-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, a great attention is focusing on the study of the emotional content in speech signals since the speech signal is one of the quickest and natural tactic to communicate among humans, and thus, many systems have been suggested to recognize the emotional content of a spoken utterance. This paper schemes two contributions: Gender recognition and Emotion recognition. In the first contribution, there has two phases: feature extraction and classification. The pitch feature is extracted and given for classification using k-Nearest Neighboring classifier. In the second contribution, features like Non-Negative Matrix Factorization and pitch are extracted and given as the input to Adaptive Fuzzy classifier to recognize the respective emotions. In addition, the limits of membership functions are optimally chosen using a renowned optimization algorithm namely glowworm swarm optimization (GSO). Thus the proposed adaptive Fuzzy classifier using GSO is termed as GSO-FC. The performance of proposed model is compared to other conventional algorithms like Grey Wolf Optimization, FireFly, Particle Swarm Optimization, Artificial Bee Colony and Genetic Algorithm in correspondence with varied performance measures like Accuracy, Sensitivity, Specificity, Precision, False positive rate, False negative rate, Negative Predictive Value, False Discovery Rate, F-1 Score and Mathews correlation coefficient.
引用
收藏
页码:939 / 953
页数:15
相关论文
共 45 条
[11]  
Glodek M, 2011, LECT NOTES COMPUT SC, V6975, P359, DOI 10.1007/978-3-642-24571-8_47
[12]  
Gonzalez S, 2011, EUR SIGNAL PR CONF, P451
[13]   An Automatic Framework for Textured 3D Video-Based Facial Expression Recognition [J].
Hayat, Munawar ;
Bennamoun, Mohammed .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2014, 5 (03) :301-313
[14]   Extraction of adaptive wavelet packet filter-bank-based acoustic feature for speech emotion recognition [J].
Huang, Yongming ;
Wu, Ao ;
Zhang, Guobao ;
Li, Yue .
IET SIGNAL PROCESSING, 2015, 9 (04) :341-348
[15]   Noisy speech emotion recognition using sample reconstruction and multiple-kernel learning [J].
Xiaoqing J. ;
Kewen X. ;
Yongliang L. ;
Jianchuan B. .
J. China Univ. Post Telecom., 2 (1,17-9) :1,17-9
[16]   Cultural dependency analysis for understanding speech emotion [J].
Kamaruddin, Norhaslinda ;
Wahab, Abdul ;
Quek, Chai .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (05) :5115-5133
[17]  
Karaboga D, 2008, APPL SOFT COMPUT, V8, P687, DOI 10.1016/j.asoc.2007.05.007
[18]   Semantic Pyramids for Gender and Action Recognition [J].
Khan, Fahad Shahbaz ;
van de Weijer, Joost ;
Anwer, Rao Muhammad ;
Felsberg, Michael ;
Gatta, Carlo .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (08) :3633-3645
[19]   Speaker-independent emotion recognition exploiting a psychologically-inspired binary cascade classification schema [J].
Kotti, Margarita ;
Paterno, Fabio .
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (02) :131-150
[20]  
Lijun Zhang, 2011, Frontiers of Electrical and Electronic Engineering in China, V6, P192, DOI 10.1007/s11460-011-0128-0