Speaker verification using coded speech

被引：0

作者：

Moreno-Daniel, A ^{[1
]}

Juang, BH

Nolazco-Flores, JA

机构：

[1] Georgia Inst Technol, Ctr Signal & Image Proc, Atlanta, GA 30332 USA

[2] Inst Tecnol & Estudios Super Monterrey, Dept Ciencias Computacionales, Monterrey, NL, Mexico

来源：

PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS | 2004年 / 3287卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The implementation of a pseudo text-independent Speaker Verification system is described. This system was designed to use only information extracted directly from the coded parameters embedded in the ITU-T G.729 bit-stream. Experiments were performed over the YOHO database [1]. The feature vector as a short-time representation of speech consists of 16 LPC-Cepstral coefficients, as well as residual information appended in the form of a pitch estimate and a measure of vocality of the speech. The robustness in verification accuracy is also studied. The results show that while speech coders, G.729 in particular, introduce coding distortions that lead to verification performance degradation, proper augmented use of unconventional information nevertheless leads to a competitive performance on par with that of a well-studied traditional system which does not involve signal coding and transmission. The result suggests that speaker verification over a cell phone connection remains feasible even though the signal has been encoded to 8 Kb/s.

引用

页码：366 / 373

页数：8

共 50 条

[21] PLDA Speaker Verification with Limited Speech Data
Ridzik, Andrej
Rusko, Milan
SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 325 - 332
[22] Speech Enhancement Regularized by a Speaker Verification Model
Lay, Bunlong
Gerkmann, Timo
2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
[23] A New Speech Corpus in Spanish for Speaker Verification
Garcia, N.
Arias-Vergara, T.
Orozco-Arroyave, J. R.
Vargas-Bonilla, J. F.
2016 XXI SYMPOSIUM ON SIGNAL PROCESSING, IMAGES AND ARTIFICIAL VISION (STSIVA), 2016,
[24] SCHEME FOR SPEECH PROCESSING IN AUTOMATIC SPEAKER VERIFICATION
DAS, SK
MOHN, WS
IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1971, AU19 (01): : 32 - &
[25] VoiceID Loss: Speech Enhancement for Speaker Verification
Shon, Suwon
Tang, Hao
Glass, James
INTERSPEECH 2019, 2019, : 2888 - 2892
[26] Using Phoneme Recognition and Text-dependent Speaker Verification to Improve Speaker Segmentation for Chinese Speech
Wang, Gang
Wu, Xiaojun
Zheng, Thomas Fang
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1457 - 1460
[27] TOWARDS DIRECTLY MODELING RAW SPEECH SIGNAL FOR SPEAKER VERIFICATION USING CNNS
Muckenhirn, Hannah
Magimai-Doss, Mathew
Marcel, Sebastien
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4884 - 4888
[28] REVISITING THE SECURITY OF SPEAKER VERIFICATION SYSTEMS AGAINST IMPOSTURE USING SYNTHETIC SPEECH
De Leon, Phillip L.
Apsingekar, Vijendra Raj
Pucher, Michael
Yamagishi, Junichi
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1798 - 1801
[29] Structure of pauses in speech in the context of speaker verification and classification of speech type
Igras-Cybulska, Magdalena
Ziolko, Bartosz
Zelasko, Piotr
Witkowski, Marcin
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2016,
[30] Structure of pauses in speech in the context of speaker verification and classification of speech type
Magdalena Igras-Cybulska
Bartosz Ziółko
Piotr Żelasko
Marcin Witkowski
EURASIP Journal on Audio, Speech, and Music Processing, 2016

← 1 2 3 4 5 →