Robust Acoustic Emotion Recognition Based on Cascaded Normalization and Extreme Learning Machines

被引:17
作者
Kaya, Heysem [1 ]
Karpov, Alexey A. [2 ,3 ]
Salah, Albert Ali [4 ]
机构
[1] Namik Kemal Univ, Corlu Fac Engn, Dept Comp Engn, Corlu, Tekirdag, Turkey
[2] Russian Acad Sci, St Petersburg Inst Informat & Automat, St Petersburg, Russia
[3] ITMO Univ, St Petersburg, Russia
[4] Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
来源
ADVANCES IN NEURAL NETWORKS - ISNN 2016 | 2016年 / 9719卷
关键词
Acoustic emotion recognition; Speech emotion recognition; Cascaded normalization; Extreme learning machines; ELM; COGNITIVE LOAD;
D O I
10.1007/978-3-319-40663-3_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the challenges in speech emotion recognition is robust and speaker-independent emotion recognition. In this paper, we take a cascaded normalization approach, combining linear speaker level, non-linear value level and feature vector level normalization to minimize speaker-related effects and to maximize class separability with linear kernel classifiers. We use extreme learning machine classifiers on a four class (i.e. joy, anger, sadness, neutral) problem. We show the efficacy of our proposed method on the recently collected Turkish Emotional Speech Database.
引用
收藏
页码:115 / 123
页数:9
相关论文
共 20 条
[1]  
[Anonymous], 1971, Generalized Inverses of Matrices and its Applications
[2]  
[Anonymous], INTERSPEECH
[3]  
[Anonymous], 17 NAT C TURK LING
[4]  
Cowie R, 2011, COGN TECHNOL, P9, DOI 10.1007/978-3-642-15184-2_2
[5]   Front-End Factor Analysis for Speaker Verification [J].
Dehak, Najim ;
Kenny, Patrick J. ;
Dehak, Reda ;
Dumouchel, Pierre ;
Ouellet, Pierre .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04) :788-798
[6]  
Eyben F., 2010, P 18 ACM INT C MULT, P1459
[7]  
Huang GB, 2004, IEEE IJCNN, P985
[8]   Extreme learning machine: Theory and applications [J].
Huang, Guang-Bin ;
Zhu, Qin-Yu ;
Siew, Chee-Kheong .
NEUROCOMPUTING, 2006, 70 (1-3) :489-501
[9]   Extreme Learning Machine for Regression and Multiclass Classification [J].
Huang, Guang-Bin ;
Zhou, Hongming ;
Ding, Xiaojian ;
Zhang, Rui .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (02) :513-529
[10]   Contrasting and Combining Least Squares Based Learners for Emotion Recognition in the Wild [J].
Kaya, Heysem ;
Gurpinar, Furkan ;
Afshar, Sadaf ;
Salah, Albert Ali .
ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2015, :459-466