Robust model for speaker verification against session-dependent utterance variation

被引：0

作者：

Matsui, T ^{[1
]}

Aikawa, K

机构：

[1] Inst Stat Math, Tokyo 1068569, Japan

[2] NTT Corp, NTT Commun Sci Labs, Tokyo 1008116, Japan

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2003年 / E86D卷 / 04期

关键词：

speaker verification; speaker model; session dependent; utterance variation; handset dependent distortion;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates a new method for creating robust speaker models to cope with inter-session variation of a speaker in a continuous HMM-based speaker verification system. The new method estimates session-independent parameters by decomposing inter-session variations into two distinct parts: session-dependent and -independent. The parameters of the speaker models are estimated using the speaker adaptive training algorithm in conjunction with the equalization of session-dependent variation. The resultant models capture the session-independent speaker characteristics more reliably than the conventional models and their discriminative power improves accordingly. Moreover we have made our models more invariant to handset variations in a public switched telephone network (PSTN) by focusing on session-dependent variation and handset-dependent distortion separately. Text-independent speech data recorded by 20 speakers in seven sessions over 16 months was used to evaluate the new approach. The proposed method reduces the error rate by 15% relatively. When compared with the popular cepstral mean normalization, the error rate is reduced by 24% relatively when the speaker models were recreated using speech data recorded in four or more sessions.

引用

页码：712 / 718

页数：7

共 18 条

[1] Model selection and score normalization for text-dependent single utterance speaker verification
Buyuk, Osman
Arslan, Mustafa Levent
TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2012, 20 : 1277 - 1295
[2] Robust Session Variability Compensation for SVM Speaker Verification
Seo, Hyunson
Jung, Chi-Sang
Kang, Hong-Goo
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1631 - 1641
[3] Robust Training for Speaker Verification against Noisy Labels
Fang, Zhihua
He, Liang
Ma, Hanhan
Guo, Xiaochen
Li, Lin
INTERSPEECH 2023, 2023, : 3192 - 3196
[4] Phoneme dependent inter-session variability reduction for speaker verification
Lu, Haoze
Zhang, Wenbin
Horiuchi, Yasuo
Kuroiwa, Shingo
INTERNATIONAL JOURNAL OF BIOMETRICS, 2015, 7 (02) : 83 - 96
[5] Intrinsic Variation Robust Speaker Verification based on Sparse Representation
Nie, Yi
Xu, Mingxing
Xianyu, Haishu
2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
[6] FURTHER OPTIMISATIONS OF CONSTANT Q CEPSTRAL PROCESSING FOR INTEGRATED UTTERANCE AND TEXT-DEPENDENT SPEAKER VERIFICATION
Delgado, Hector
Todisco, Massimiliano
Sahidullah, Md
Sarkar, Achintya K.
Evans, Nicholas
Kinnunen, Tomi
Tan, Zheng-Hua
2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 179 - 185
[7] Fusion of SNR-Dependent PLDA Models for Noise Robust Speaker Verification
Pang, Xiaomin
Mak, Man-Wai
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 619 - 623
[8] Robust discriminative training against data insufficiency in PLDA-based speaker verification
Rohdin, Johan
Biswas, Sangeeta
Shinoda, Koichi
COMPUTER SPEECH AND LANGUAGE, 2016, 35 : 32 - 57
[9] Acoustic Factor Analysis based Universal Background Model for Robust Speaker Verification in Noise
Hasan, Taufiq
Hansen, John H. L.
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3126 - 3130
[10] Analysis of Deep Generative Model Impact on Feature Extraction and Dimension Reduction for Short Utterance Text-Independent Speaker Verification
Farhadipour, Aref
Veisi, Hadi
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (7) : 4547 - 4564

← 1 2 →