Emotional Speaker Verification Based on I-vectors

被引:0
作者
Mackova, Lenka [1 ]
Cizmar, Anton [1 ]
机构
[1] Tech Univ Kosice, Fac Elect Engn & Informat, Dept Elect & Multimedia Commun, Kosice, Slovakia
来源
2014 5TH IEEE CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM) | 2014年
关键词
speaker recognition; emotions; i-vectors; total variability;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently i-vectors approach in speaker verification become very successful and popular. The i-vectors principle is based on representation of each utterance by low-dimensional feature vector of fixed length. In this experiment for purposes of speaker recognition emotional speech database was applied. Using the i-vector principle two concepts of speaker model training were performed. In the process of features extraction the Mel Frequency Cepstral Coefficients (MFCC) with different number of coefficients in combination with coefficient of log energy, the first, second and third regression coefficients were used. Mahalanobis distance metric and Cosine Distance Scoring (CSS) metric were used for classification of the speaker recognition in this paper. In this work our own emotional database - SUS - of recordings in Slovak language was introduced. Utterances of male speakers of mentioned corpus were used as an input to the verification system.
引用
收藏
页码:533 / 536
页数:4
相关论文
共 50 条
  • [31] Regional Accents Recognition based on i-vectors approach: The Case of the Algerian linguistic environment
    Djellab, Mourad
    Amrouche, Abderrahmane
    Mehallegue, Noureddine
    Bouridane, Ahmed
    2015 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 166 - U400
  • [32] Probabilistic approach using joint long and short session i-vectors modeling to deal with short utterances for speaker recognition
    Ben Kheder, Waad
    Matrouf, Driss
    Ajili, Moez
    Bonastre, Jean-Francois
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1830 - 1834
  • [33] HANDLING I-VECTORS FROM DIFFERENT RECORDING CONDITIONS USING MULTI-CHANNEL SIMPLIFIED PLDA IN SPEAKER RECOGNITION
    Villalba, Jesus
    Lleida, Eduardo
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6763 - 6767
  • [34] Multimodal i-vectors to Detect and Evaluate Parkinson's Disease
    Garcia, N.
    Vasquez-Correa, J. C.
    Orozco-Arroyave, J. R.
    Noth, E.
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2349 - 2353
  • [35] Automatic Evaluation of Speech Intelligibility Based on i-vectors in the Context of Head and Neck Cancers
    Laaridh, Imed
    Fredouille, Corinne
    Ghio, Alain
    Lalain, Muriel
    Woisard, Virginie
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2943 - 2947
  • [36] Audio-Visual Speech Separation Using I-Vectors
    Luo, Yiyu
    Wang, Jing
    Wang, Xinyao
    Wen, Liang
    Wang, Lizhong
    2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, : 276 - 280
  • [37] TIMBRAL MODELING FOR MUSIC ARTIST RECOGNITION USING I-VECTORS
    Eghbal-zadeh, Hamid
    Schedl, Markus
    Widmer, Gerhard
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 1286 - 1290
  • [38] Multisource I-Vectors Domain Adaptation Using Maximum Mean Discrepancy Based Autoencoders
    Lin, Wei-wei
    Mak, Man-Wai
    Chien, Jen-Tzung
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (12) : 2412 - 2422
  • [39] Unleashing the Unused Potential of I-Vectors Enabled by GPU Acceleration
    Vestman, Ville
    Lee, Kong Aik
    Kinnunen, Tomi H.
    Koshinaka, Takafumi
    INTERSPEECH 2019, 2019, : 351 - 355
  • [40] Incorporation of discriminative n-grams to improve a phonotactic language recognizer based on i-vectors
    Salamea Palaciosi, Christian
    Fernando D'Haro, Luis
    Cordoba, Ricardo
    Angel Caraballo, Miguel
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2013, (51): : 145 - 152