Speaker Identification and Verification from Audio Coded Speech in Matched and Mismatched Conditions

被引:6
|
作者
Jiang, Tao [1 ]
Gao, Boyang [1 ]
Han, Jiqing [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150006, Peoples R China
关键词
MODELS;
D O I
10.1109/ROBIO.2009.5420478
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate the effect of audio coding on speaker identification and verification when training and testing conditions are matched and mismatched. Experiments use popular audio coding algorithms (Windows Media Audio 9.1, Advanced Audio Coding, MPEG Audio Layer III) and a speaker identification and verification system based on Gaussian mixture models. There is some loss in identification and verification performance for audio coding process without the change of sample rate, and a great loss when sample rate changes during audio coding process.
引用
收藏
页码:2199 / 2204
页数:6
相关论文
共 50 条
  • [1] Speaker verification using coded speech
    Moreno-Daniel, A
    Juang, BH
    Nolazco-Flores, JA
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, 2004, 3287 : 366 - 373
  • [2] Speaker verification from coded telephone speech using stochastic feature transformation and handset identification
    Yu, EWM
    Mak, MW
    Kung, SY
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2002, PROCEEDING, 2002, 2532 : 598 - 606
  • [3] SPEAKER GENDER IDENTIFICATION IN MATCHED AND MISMATCHED CONDITIONS BASED ON STACKING ENSEMBLE METHOD
    Badr, Ameer A.
    Abdul-Hassan, Alia K.
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2022, 17 (02): : 1119 - 1134
  • [4] Speaker verification under mismatched data conditions
    Pillay, S. G.
    Ariyaeeinia, A.
    Pawlewski, M.
    Sivakumaran, P.
    IET SIGNAL PROCESSING, 2009, 3 (04) : 236 - 246
  • [5] Speaker Identification from Mixture of Speech and Non-speech Audio Signal
    Yasmin, Ghazaala
    Dhara, Subrata
    Mahindar, Rudrendu
    Das, Asit Kumar
    SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 473 - 482
  • [6] IMPROVING THE PERFORMANCE OF VTLN UNDER MISMATCHED SPEAKER CONDITIONS AND MAKING IT APPROACH THAT OF MATCHED SPEAKER CONDITIONS
    Sanand, D. R.
    Rath, S. P.
    Umesh, S.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4397 - 4400
  • [7] SVM-BASED SPEAKER VERIFICATION FOR CODED AND UNCODED SPEECH
    Janicki, Artur
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 26 - 30
  • [8] Variants of Mel-frequency Cepstral Coefficients for Improved Whispered Speech Speaker Verification in Mismatched Conditions
    Sarria-Paja, Milton
    Falk, Tiago H.
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 91 - 95
  • [9] Assessment of Single-Channel Speech Enhancement Techniques for Speaker Identification under Mismatched Conditions
    Sadjadi, Seyed Omid
    Hansen, John H. L.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2138 - 2141
  • [10] SPEAKER IDENTIFICATION IN LOW-RATE CODED SPEECH
    Catellier, Andrew
    Voran, Stephen
    MEASUREMENT OF SPEECH, AUDIO AND VIDEO QUALITY IN NETWORKS, 2008, : 27 - 36