Language-independent hyperparameter optimization based speech emotion recognition system

被引:7
|
作者
Thakur A. [1 ]
Dhull S.K. [1 ]
机构
[1] Guru Jambheshwar University of Science and Technology, Hisar
关键词
Feature extraction; Hyperparameter optimization; Machine learning; Speech emotion recognition;
D O I
10.1007/s41870-022-00996-9
中图分类号
学科分类号
摘要
Speech emotion recognition is challenging due to substantially overlapping regions of emotions. Extracting desired features that influence emotions in a speech and categorizing these emotions is a tedious task. We intend to develop an effective and robust speech emotion recognition system capable of classifying ambiguous and overlapping emotions through this manuscript. Three feature sets Spectral, Prosodic, and Discrete Wavelet Transform are extracted and further processed to reduce the required combination of features. The use of hyper-parameter optimization in the machine learning model has been done to tune the support vector machine classifier parameter for the Speech emotion recognition system. The suggested model is also verified with two different language datasets: ‘SAVEE’ and ‘EmoDB’ resulting in a language-independent emotion recognition system from speech. The performance result achieved by employing the proposed technique in EmoDB with 535 samples and SAVEE with 480 samples in seven different emotion types is 90.02% and 71.66%, respectively. © 2022, The Author(s), under exclusive licence to Bharati Vidyapeeth's Institute of Computer Applications and Management.
引用
收藏
页码:3691 / 3699
页数:8
相关论文
共 50 条
  • [1] Language-independent computer emotion recognition
    Mitsuyoshi, S
    Ren, FJ
    Proceedings of the Ninth IASTED International Conference on Artificial Intelligence and Soft Computing, 2005, : 417 - 422
  • [2] Multiclass SVM-based Language-Independent Emotion Recognition using Selective Speech Features
    Amol, Kokane T.
    Guddeti, Ram Mohana Reddy
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2014, : 1069 - 1073
  • [3] Investigation of speech-based language-independent possibilities of depression recognition
    Kiss, Gabor
    2022 45TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING, TSP, 2022, : 226 - 229
  • [4] Domain Generalization for Language-Independent Automatic Speech Recognition
    Gao, Heting
    Ni, Junrui
    Zhang, Yang
    Qian, Kaizhi
    Chang, Shiyu
    Hasegawa-Johnson, Mark
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
  • [5] Language-independent and language-adaptive acoustic modeling for speech recognition
    Schultz, T
    Waibel, A
    SPEECH COMMUNICATION, 2001, 35 (1-2) : 31 - 51
  • [6] An Efficient Language-Independent Acoustic Emotion Classification System
    Rajwinder Singh
    Harshita Puri
    Naveen Aggarwal
    Varun Gupta
    Arabian Journal for Science and Engineering, 2020, 45 : 3111 - 3121
  • [7] An Efficient Language-Independent Acoustic Emotion Classification System
    Singh, Rajwinder
    Puri, Harshita
    Aggarwal, Naveen
    Gupta, Varun
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2020, 45 (04) : 3111 - 3121
  • [8] Speaker-and language-independent speech recognition in mobile communication systems
    Viikki, I
    Kiss, I
    Tian, J
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 5 - 8
  • [9] CONFIDENCE INDEX DYNAMIC TIME WARPING FOR LANGUAGE-INDEPENDENT EMBEDDED SPEECH RECOGNITION
    Zhang, Xianglilan
    Sun, Jiping
    Luo, Zhigang
    Li, Ming
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8066 - 8070
  • [10] Speaker based Language Independent Isolated Speech Recognition System
    Therese, Shanthi S.
    Lingam, Chelpa
    2015 INTERNATIONAL CONFERENCE ON COMMUNICATION, INFORMATION & COMPUTING TECHNOLOGY (ICCICT), 2015,