Speaker verification using coded speech

被引:0
|
作者
Moreno-Daniel, A [1 ]
Juang, BH
Nolazco-Flores, JA
机构
[1] Georgia Inst Technol, Ctr Signal & Image Proc, Atlanta, GA 30332 USA
[2] Inst Tecnol & Estudios Super Monterrey, Dept Ciencias Computacionales, Monterrey, NL, Mexico
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The implementation of a pseudo text-independent Speaker Verification system is described. This system was designed to use only information extracted directly from the coded parameters embedded in the ITU-T G.729 bit-stream. Experiments were performed over the YOHO database [1]. The feature vector as a short-time representation of speech consists of 16 LPC-Cepstral coefficients, as well as residual information appended in the form of a pitch estimate and a measure of vocality of the speech. The robustness in verification accuracy is also studied. The results show that while speech coders, G.729 in particular, introduce coding distortions that lead to verification performance degradation, proper augmented use of unconventional information nevertheless leads to a competitive performance on par with that of a well-studied traditional system which does not involve signal coding and transmission. The result suggests that speaker verification over a cell phone connection remains feasible even though the signal has been encoded to 8 Kb/s.
引用
收藏
页码:366 / 373
页数:8
相关论文
共 50 条
  • [21] PLDA Speaker Verification with Limited Speech Data
    Ridzik, Andrej
    Rusko, Milan
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 325 - 332
  • [22] Speech Enhancement Regularized by a Speaker Verification Model
    Lay, Bunlong
    Gerkmann, Timo
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [23] A New Speech Corpus in Spanish for Speaker Verification
    Garcia, N.
    Arias-Vergara, T.
    Orozco-Arroyave, J. R.
    Vargas-Bonilla, J. F.
    2016 XXI SYMPOSIUM ON SIGNAL PROCESSING, IMAGES AND ARTIFICIAL VISION (STSIVA), 2016,
  • [24] SCHEME FOR SPEECH PROCESSING IN AUTOMATIC SPEAKER VERIFICATION
    DAS, SK
    MOHN, WS
    IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1971, AU19 (01): : 32 - &
  • [25] VoiceID Loss: Speech Enhancement for Speaker Verification
    Shon, Suwon
    Tang, Hao
    Glass, James
    INTERSPEECH 2019, 2019, : 2888 - 2892
  • [26] Using Phoneme Recognition and Text-dependent Speaker Verification to Improve Speaker Segmentation for Chinese Speech
    Wang, Gang
    Wu, Xiaojun
    Zheng, Thomas Fang
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1457 - 1460
  • [27] TOWARDS DIRECTLY MODELING RAW SPEECH SIGNAL FOR SPEAKER VERIFICATION USING CNNS
    Muckenhirn, Hannah
    Magimai-Doss, Mathew
    Marcel, Sebastien
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4884 - 4888
  • [28] REVISITING THE SECURITY OF SPEAKER VERIFICATION SYSTEMS AGAINST IMPOSTURE USING SYNTHETIC SPEECH
    De Leon, Phillip L.
    Apsingekar, Vijendra Raj
    Pucher, Michael
    Yamagishi, Junichi
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1798 - 1801
  • [29] Structure of pauses in speech in the context of speaker verification and classification of speech type
    Igras-Cybulska, Magdalena
    Ziolko, Bartosz
    Zelasko, Piotr
    Witkowski, Marcin
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2016,
  • [30] Structure of pauses in speech in the context of speaker verification and classification of speech type
    Magdalena Igras-Cybulska
    Bartosz Ziółko
    Piotr Żelasko
    Marcin Witkowski
    EURASIP Journal on Audio, Speech, and Music Processing, 2016