Glowworm swarm based fuzzy classifier with dual features for speech emotion recognition

被引:1
作者
Rajasekhar, B. [1 ]
Kamaraju, M. [2 ]
Sumalatha, V [1 ]
机构
[1] Jawaharlal Nehru Technol Univ Ananthapur, Ananthapuramu, Andhra Pradesh, India
[2] Gudlavalleru Engn Coll, Gudlavalleru, Andhra Pradesh, India
关键词
Emotion recognition; Gender recognition; Speech signal; Fuzzy classifier; Glowworm swarm optimization; GENDER; ALGORITHM; NETWORKS; ROBUST;
D O I
10.1007/s12065-019-00262-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, a great attention is focusing on the study of the emotional content in speech signals since the speech signal is one of the quickest and natural tactic to communicate among humans, and thus, many systems have been suggested to recognize the emotional content of a spoken utterance. This paper schemes two contributions: Gender recognition and Emotion recognition. In the first contribution, there has two phases: feature extraction and classification. The pitch feature is extracted and given for classification using k-Nearest Neighboring classifier. In the second contribution, features like Non-Negative Matrix Factorization and pitch are extracted and given as the input to Adaptive Fuzzy classifier to recognize the respective emotions. In addition, the limits of membership functions are optimally chosen using a renowned optimization algorithm namely glowworm swarm optimization (GSO). Thus the proposed adaptive Fuzzy classifier using GSO is termed as GSO-FC. The performance of proposed model is compared to other conventional algorithms like Grey Wolf Optimization, FireFly, Particle Swarm Optimization, Artificial Bee Colony and Genetic Algorithm in correspondence with varied performance measures like Accuracy, Sensitivity, Specificity, Precision, False positive rate, False negative rate, Negative Predictive Value, False Discovery Rate, F-1 Score and Mathews correlation coefficient.
引用
收藏
页码:939 / 953
页数:15
相关论文
共 45 条
[1]   Anchor Models for Emotion Recognition from Speech [J].
Attabi, Yazid ;
Dumouchel, Pierre .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2013, 4 (03) :280-290
[2]   Fusion of Domain-Specific and Trainable Features for Gender Recognition From Face Images [J].
Azzopardi, George ;
Greco, Antonio ;
Saggese, Alessia ;
Vento, Mario .
IEEE ACCESS, 2018, 6 :24171-24183
[3]  
Basu S., 2017, P 2 INT C COMM EL SY, P333, DOI 10.1109/CESYS.2017.8321292
[4]   Semisupervised Autoencoders for Speech Emotion Recognition [J].
Deng, Jun ;
Xu, Xinzhou ;
Zhang, Zixing ;
Fruehholz, Sascha ;
Schuller, Bjorn .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (01) :31-43
[5]   Universum Autoencoder-Based Domain Adaptation for Speech Emotion Recognition [J].
Deng, Jun ;
Xu, Xinzhou ;
Zhang, Zixing ;
Fruhholz, Sascha ;
Schuller, Bjorn .
IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (04) :500-504
[6]   Exploitation of Phase-Based Features for Whispered Speech Emotion Recognition [J].
Deng, Jun ;
Xu, Xinzhou ;
Zhang, Zixing ;
Fruehholz, Sascha ;
Schuller, Bjoern .
IEEE ACCESS, 2016, 4 :4299-4309
[7]   Autoencoder-based Unsupervised Domain Adaptation for Speech Emotion Recognition [J].
Deng, Jun ;
Zhang, Zixing ;
Eyben, Florian ;
Schuller, Bjoern .
IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (09) :1068-1072
[8]   Survey on speech emotion recognition: Features, classification schemes, and databases [J].
El Ayadi, Moataz ;
Kamel, Mohamed S. ;
Karray, Fakhri .
PATTERN RECOGNITION, 2011, 44 (03) :572-587
[9]   Firefly algorithm with chaos [J].
Gandomi, A. H. ;
Yang, X-S. ;
Talatahari, S. ;
Alavi, A. H. .
COMMUNICATIONS IN NONLINEAR SCIENCE AND NUMERICAL SIMULATION, 2013, 18 (01) :89-98
[10]   Multiview Supervised Dictionary Learning in Speech Emotion Recognition [J].
Gangeh, Mehrdad J. ;
Fewzee, Pouria ;
Ghodsi, Ali ;
Kamel, Mohamed S. ;
Karray, Fakhri .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (06) :1056-1068