Robust model for speaker verification against session-dependent utterance variation

被引:0
|
作者
Matsui, T [1 ]
Aikawa, K
机构
[1] Inst Stat Math, Tokyo 1068569, Japan
[2] NTT Corp, NTT Commun Sci Labs, Tokyo 1008116, Japan
来源
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2003年 / E86D卷 / 04期
关键词
speaker verification; speaker model; session dependent; utterance variation; handset dependent distortion;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates a new method for creating robust speaker models to cope with inter-session variation of a speaker in a continuous HMM-based speaker verification system. The new method estimates session-independent parameters by decomposing inter-session variations into two distinct parts: session-dependent and -independent. The parameters of the speaker models are estimated using the speaker adaptive training algorithm in conjunction with the equalization of session-dependent variation. The resultant models capture the session-independent speaker characteristics more reliably than the conventional models and their discriminative power improves accordingly. Moreover we have made our models more invariant to handset variations in a public switched telephone network (PSTN) by focusing on session-dependent variation and handset-dependent distortion separately. Text-independent speech data recorded by 20 speakers in seven sessions over 16 months was used to evaluate the new approach. The proposed method reduces the error rate by 15% relatively. When compared with the popular cepstral mean normalization, the error rate is reduced by 24% relatively when the speaker models were recreated using speech data recorded in four or more sessions.
引用
收藏
页码:712 / 718
页数:7
相关论文
共 18 条
  • [1] Model selection and score normalization for text-dependent single utterance speaker verification
    Buyuk, Osman
    Arslan, Mustafa Levent
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2012, 20 : 1277 - 1295
  • [2] Robust Session Variability Compensation for SVM Speaker Verification
    Seo, Hyunson
    Jung, Chi-Sang
    Kang, Hong-Goo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1631 - 1641
  • [3] Robust Training for Speaker Verification against Noisy Labels
    Fang, Zhihua
    He, Liang
    Ma, Hanhan
    Guo, Xiaochen
    Li, Lin
    INTERSPEECH 2023, 2023, : 3192 - 3196
  • [4] Phoneme dependent inter-session variability reduction for speaker verification
    Lu, Haoze
    Zhang, Wenbin
    Horiuchi, Yasuo
    Kuroiwa, Shingo
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2015, 7 (02) : 83 - 96
  • [5] Intrinsic Variation Robust Speaker Verification based on Sparse Representation
    Nie, Yi
    Xu, Mingxing
    Xianyu, Haishu
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [6] FURTHER OPTIMISATIONS OF CONSTANT Q CEPSTRAL PROCESSING FOR INTEGRATED UTTERANCE AND TEXT-DEPENDENT SPEAKER VERIFICATION
    Delgado, Hector
    Todisco, Massimiliano
    Sahidullah, Md
    Sarkar, Achintya K.
    Evans, Nicholas
    Kinnunen, Tomi
    Tan, Zheng-Hua
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 179 - 185
  • [7] Fusion of SNR-Dependent PLDA Models for Noise Robust Speaker Verification
    Pang, Xiaomin
    Mak, Man-Wai
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 619 - 623
  • [8] Robust discriminative training against data insufficiency in PLDA-based speaker verification
    Rohdin, Johan
    Biswas, Sangeeta
    Shinoda, Koichi
    COMPUTER SPEECH AND LANGUAGE, 2016, 35 : 32 - 57
  • [9] Acoustic Factor Analysis based Universal Background Model for Robust Speaker Verification in Noise
    Hasan, Taufiq
    Hansen, John H. L.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3126 - 3130
  • [10] Analysis of Deep Generative Model Impact on Feature Extraction and Dimension Reduction for Short Utterance Text-Independent Speaker Verification
    Farhadipour, Aref
    Veisi, Hadi
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (7) : 4547 - 4564