Speaker Identification Enhancement Using Emotional Features

被引:0
|
作者
Jabnoun, Jihed [1 ]
Zrigui, Ahmed [2 ]
Slimi, Anwer [1 ]
Ringeval, Fabien [3 ]
Schwab, Didier [3 ]
Zrigui, Mounir [1 ]
机构
[1] Univ Monastir, Res Lab Algebra, Numbers Theory & Intelligent Syst, Monastir, Tunisia
[2] Univ Bordeaux, LaBRI Lab, Nouvelle Aquitaine, France
[3] Univ Grenoble Alpes, CNRS Grenoble, Grenoble, France
关键词
SR; Speaker Segmentation; Triplet Loss; Emotion Recognition; CNN; Bi-LSTM; DIARIZATION;
D O I
10.1007/978-3-031-41456-5_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speaker recognition is a broad field that encompasses many different tasks related to identifying speakers in audio recordings. Two specific sub-tasks that are often studied are speaker segmentation and speaker identification. These tasks typically involve analyzing acoustic features of the audio to determine who is speaking. However, one limitation of traditional speaker identification methods is that they can struggle when dealing with emotional conversations, as the acoustic features can change due to the emotions being expressed. To address this limitation, focuses on studying the effect of emotion on speaker identification by combining features of both the emotions and speakers. This approach has shown to improve identification accuracy, increasing it from 72% using speaker features alone to 75% when both emotion and speaker features are used.
引用
收藏
页码:526 / 539
页数:14
相关论文
共 50 条
  • [1] Performance enhancement of speaker identification systems using speech encryption and cancelable features
    Soliman N.F.
    Mostfa Z.
    El-Samie F.E.A.
    Abdalla M.I.
    Soliman, Naglaa F. (nagla_soliman@yahoo.com), 1600, Springer Science and Business Media, LLC (20): : 977 - 1004
  • [2] Speaker Modeling Using Emotional Speech for More Robust Speaker Identification
    M. Milošević
    Ž. Nedeljković
    U. Glavitsch
    Ž. Đurović
    Journal of Communications Technology and Electronics, 2019, 64 : 1256 - 1265
  • [3] Speaker Modeling Using Emotional Speech for More Robust Speaker Identification
    Milosevic, M.
    Nedeljkovic, Z.
    Glavitsch, U.
    Durovic, Z.
    JOURNAL OF COMMUNICATIONS TECHNOLOGY AND ELECTRONICS, 2019, 64 (11) : 1256 - 1265
  • [4] Speaker identification using speech and lip features
    Ou, GB
    Li, X
    Yao, XC
    Jia, HB
    Murphey, YL
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 2565 - 2570
  • [5] Speaker identification using nonlinear dynamical features
    Petry, A
    Barone, DAC
    CHAOS SOLITONS & FRACTALS, 2002, 13 (02) : 221 - 231
  • [6] Speaker identification using orthogonal and discriminative features
    Davarpanah, SH
    Mirzaei, A
    Ziaei, A
    IWSSIP 2005: Proceedings of the 12th International Worshop on Systems, Signals & Image Processing, 2005, : 293 - 296
  • [7] Speaker Identification Using HHT Spectrum Features
    Liu, Jia-Wei
    Wang, Jia-Ching
    Lin, Chang-Hong
    2011 INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2011), 2011, : 145 - 148
  • [8] Speaker identification in emotional environments
    Department of Electrical and Computer Engineering, University of Sharjah, Sharjah, United Arab Emirates
    Iran. J. Electr. Comput. Eng., 2009, 1 (41-46):
  • [9] Emotional speaker identification using a novel capsule nets model
    Nassif, Ali Bou
    Shahin, Ismail
    Elnagar, Ashraf
    Velayudhan, Divya
    Alhudhaif, Adi
    Polat, Kemal
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 193
  • [10] Speaker Identification using Selected Features from DWT
    AbuAladas, Feras E.
    Samawi, Venus W.
    2013 International Conference on Advanced Computer Science Applications and Technologies (ACSAT), 2014, : 114 - 119