Emotional Speaker Verification Based on I-vectors

被引：0

作者：

Mackova, Lenka ^{[1
]}

Cizmar, Anton ^{[1
]}

机构：

[1] Tech Univ Kosice, Fac Elect Engn & Informat, Dept Elect & Multimedia Commun, Kosice, Slovakia

来源：

2014 5TH IEEE CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM) | 2014年

关键词：

speaker recognition; emotions; i-vectors; total variability;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently i-vectors approach in speaker verification become very successful and popular. The i-vectors principle is based on representation of each utterance by low-dimensional feature vector of fixed length. In this experiment for purposes of speaker recognition emotional speech database was applied. Using the i-vector principle two concepts of speaker model training were performed. In the process of features extraction the Mel Frequency Cepstral Coefficients (MFCC) with different number of coefficients in combination with coefficient of log energy, the first, second and third regression coefficients were used. Mahalanobis distance metric and Cosine Distance Scoring (CSS) metric were used for classification of the speaker recognition in this paper. In this work our own emotional database - SUS - of recordings in Slovak language was introduced. Utterances of male speakers of mentioned corpus were used as an input to the verification system.

引用

页码：533 / 536

页数：4

共 50 条

[31] Regional Accents Recognition based on i-vectors approach: The Case of the Algerian linguistic environment
Djellab, Mourad
Amrouche, Abderrahmane
Mehallegue, Noureddine
Bouridane, Ahmed
2015 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 166 - U400
[32] Probabilistic approach using joint long and short session i-vectors modeling to deal with short utterances for speaker recognition
Ben Kheder, Waad
Matrouf, Driss
Ajili, Moez
Bonastre, Jean-Francois
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1830 - 1834
[33] HANDLING I-VECTORS FROM DIFFERENT RECORDING CONDITIONS USING MULTI-CHANNEL SIMPLIFIED PLDA IN SPEAKER RECOGNITION
Villalba, Jesus
Lleida, Eduardo
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6763 - 6767
[34] Multimodal i-vectors to Detect and Evaluate Parkinson's Disease
Garcia, N.
Vasquez-Correa, J. C.
Orozco-Arroyave, J. R.
Noth, E.
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2349 - 2353
[35] Automatic Evaluation of Speech Intelligibility Based on i-vectors in the Context of Head and Neck Cancers
Laaridh, Imed
Fredouille, Corinne
Ghio, Alain
Lalain, Muriel
Woisard, Virginie
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2943 - 2947
[36] Audio-Visual Speech Separation Using I-Vectors
Luo, Yiyu
Wang, Jing
Wang, Xinyao
Wen, Liang
Wang, Lizhong
2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, : 276 - 280
[37] TIMBRAL MODELING FOR MUSIC ARTIST RECOGNITION USING I-VECTORS
Eghbal-zadeh, Hamid
Schedl, Markus
Widmer, Gerhard
2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 1286 - 1290
[38] Multisource I-Vectors Domain Adaptation Using Maximum Mean Discrepancy Based Autoencoders
Lin, Wei-wei
Mak, Man-Wai
Chien, Jen-Tzung
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (12) : 2412 - 2422
[39] Unleashing the Unused Potential of I-Vectors Enabled by GPU Acceleration
Vestman, Ville
Lee, Kong Aik
Kinnunen, Tomi H.
Koshinaka, Takafumi
INTERSPEECH 2019, 2019, : 351 - 355
[40] Incorporation of discriminative n-grams to improve a phonotactic language recognizer based on i-vectors
Salamea Palaciosi, Christian
Fernando D'Haro, Luis
Cordoba, Ricardo
Angel Caraballo, Miguel
PROCESAMIENTO DEL LENGUAJE NATURAL, 2013, (51): : 145 - 152

← 1 2 3 4 5 →