Experiments in SVM-based Speaker Verification Using Short Utterances

被引:0
|
作者
McLaren, Mitchell [1 ]
Vogt, Robbie [1 ]
Baker, Brendan [1 ]
Sridharan, Sridha [1 ]
机构
[1] Queensland Univ Technol, Speech & Audio Res Lab, Brisbane, Qld, Australia
来源
ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP | 2010年
基金
澳大利亚研究理事会;
关键词
SUPPORT VECTOR MACHINES; RECOGNITION; COMPENSATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates the effects of limited speech data in the context of speaker verification using the Gaussian mixture model (GMM) mean supervector support vector machine (SVM) classifier. This classifier provides state-of-the-art performance when sufficient speech is available, however, its robustness to the effects of limited speech resources has not yet been ascertained. Verification performance is analysed with regards to the duration of impostor utterances used for background, score normalisation and session compensation training cohorts. Results highlight the importance of matching the speech duration of utterances in these cohorts to the expected evaluation conditions. Performance was shown to be particularly sensitive to the utterance duration of examples in the background dataset. It was also found that the nuisance attribute projection (NAP) approach to session compensation often degrades performance when both training and testing data are limited. An analysis of the session and speaker variability in the mean supervector space provides some insight into the cause of this phenomenon.
引用
收藏
页码:83 / 90
页数:8
相关论文
共 50 条
  • [1] Using Discrete Probabilities With Bhattacharyya Measure for SVM-Based Speaker Verification
    Lee, Kong Aik
    You, Chang Huai
    Li, Haizhou
    Kinnunen, Tomi
    Sim, Khe Chai
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 861 - 870
  • [2] Nonparametric feature normalization for SVM-based speaker verification
    Stolcke, Andreas
    Kajarekar, Sachin
    Ferrer, Luciana
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 1577 - +
  • [3] VLSI Design for SVM-Based Speaker Verification System
    Wang, Jia-Ching
    Lian, Li-Xun
    Lin, Yan-Yu
    Zhao, Jia-Hao
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2015, 23 (07) : 1355 - 1359
  • [4] SVM-BASED SPEAKER VERIFICATION FOR CODED AND UNCODED SPEECH
    Janicki, Artur
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 26 - 30
  • [5] Comparison of Two Kinds of Speaker Location Representation for SVM-based Speaker Verification
    Zhao, Xianyu
    Dong, Yuan
    Yang, Hao
    Zhao, Jian
    Lu, Liang
    Wang, Haila
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1001 - +
  • [6] SVM-based speaker verification by location in the space of reference speakers
    Zhao, Xianyu
    Dong, Yuan
    Yang, Hao
    Zhao, Xan
    Wang, Haila
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 281 - +
  • [7] SVM-based speaker verification algorithm for match-on-card
    Choi, WY
    Lee, K
    Pan, SB
    Chung, Y
    PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, 2004, : 210 - 214
  • [8] DISCRETE EXPECTED LIKELIHOOD KERNEL FOR SVM-BASED SPEAKER VERIFICATION
    Lee, Kong Aik
    Li, Haizhou
    You, Chang Huai
    Kinnunen, Tomi
    Sim, Khe Chai
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 591 - 595
  • [9] A Study of Phonetic Feature Representations for SVM-Based Speaker Verification
    Merkley, Erik
    Baker, Brendan
    Vogt, Robert
    Sridharan, Sridha
    ICSPCS: 2ND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, PROCEEDINGS, 2008, : 358 - 362
  • [10] Cluster Adaptive Training Weights as Features in SVM-Based Speaker Verification
    Yang, Hao
    Dong, Yuan
    Zhao, Xianyu
    Zhao, Jian
    Lu, Liang
    Wang, Haila
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 573 - +