Speaker Identification and Verification from Audio Coded Speech in Matched and Mismatched Conditions

被引：6

作者：

Jiang, Tao ^{[1
]}

Gao, Boyang ^{[1
]}

Han, Jiqing ^{[1
]}

机构：

[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150006, Peoples R China

来源：

2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2009), VOLS 1-4 | 2009年

关键词：

MODELS;

D O I：

10.1109/ROBIO.2009.5420478

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We investigate the effect of audio coding on speaker identification and verification when training and testing conditions are matched and mismatched. Experiments use popular audio coding algorithms (Windows Media Audio 9.1, Advanced Audio Coding, MPEG Audio Layer III) and a speaker identification and verification system based on Gaussian mixture models. There is some loss in identification and verification performance for audio coding process without the change of sample rate, and a great loss when sample rate changes during audio coding process.

引用

页码：2199 / 2204

页数：6

共 50 条

[1] Speaker verification using coded speech
Moreno-Daniel, A
Juang, BH
Nolazco-Flores, JA
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, 2004, 3287 : 366 - 373
[2] Speaker verification from coded telephone speech using stochastic feature transformation and handset identification
Yu, EWM
Mak, MW
Kung, SY
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2002, PROCEEDING, 2002, 2532 : 598 - 606
[3] SPEAKER GENDER IDENTIFICATION IN MATCHED AND MISMATCHED CONDITIONS BASED ON STACKING ENSEMBLE METHOD
Badr, Ameer A.
Abdul-Hassan, Alia K.
JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2022, 17 (02): : 1119 - 1134
[4] Speaker verification under mismatched data conditions
Pillay, S. G.
Ariyaeeinia, A.
Pawlewski, M.
Sivakumaran, P.
IET SIGNAL PROCESSING, 2009, 3 (04) : 236 - 246
[5] Speaker Identification from Mixture of Speech and Non-speech Audio Signal
Yasmin, Ghazaala
Dhara, Subrata
Mahindar, Rudrendu
Das, Asit Kumar
SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 473 - 482
[6] IMPROVING THE PERFORMANCE OF VTLN UNDER MISMATCHED SPEAKER CONDITIONS AND MAKING IT APPROACH THAT OF MATCHED SPEAKER CONDITIONS
Sanand, D. R.
Rath, S. P.
Umesh, S.
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4397 - 4400
[7] SVM-BASED SPEAKER VERIFICATION FOR CODED AND UNCODED SPEECH
Janicki, Artur
2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 26 - 30
[8] Variants of Mel-frequency Cepstral Coefficients for Improved Whispered Speech Speaker Verification in Mismatched Conditions
Sarria-Paja, Milton
Falk, Tiago H.
2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 91 - 95
[9] Assessment of Single-Channel Speech Enhancement Techniques for Speaker Identification under Mismatched Conditions
Sadjadi, Seyed Omid
Hansen, John H. L.
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2138 - 2141
[10] SPEAKER IDENTIFICATION IN LOW-RATE CODED SPEECH
Catellier, Andrew
Voran, Stephen
MEASUREMENT OF SPEECH, AUDIO AND VIDEO QUALITY IN NETWORKS, 2008, : 27 - 36

← 1 2 3 4 5 →