Score-Aging Calibration for Speaker Verification

被引:13
作者
Kelly, Finnian [1 ]
Hansen, John H. L. [1 ]
机构
[1] Univ Texas Dallas, Ctr Robust Speech Syst, Richardson, TX 75080 USA
基金
美国国家科学基金会;
关键词
Aging; calibration; quality measures; speaker variability; speaker verification; RECOGNITION;
D O I
10.1109/TASLP.2016.2602542
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The gradual changes that occur in the human voice due to aging create challenges for speaker verification. This study presents an approach to calibrating the output scores of a speaker verification system using the time interval between comparison samples as additional information. Several functions are proposed for the incorporation of this time information, which is viewed as aging information, in a conventional linear score calibration transformation. Experiments are presented on data with short-term aging intervals ranging between 2 months and 3 years, and long-term aging intervals of up to 30 years. The aging calibration proposal is shown to offset the decreased discrimination and calibration performance for both short-and long-term intervals, and to extrapolate well to unseen aging intervals. Relative reductions in C-llr (cost of log-likelihood ratio) of 1-4% and 10-43% are obtained at short-and long-term intervals, respectively. Assuming that a system has knowledge of the time interval between samples under comparison, this approach represents a straightforward means of compensating for the detrimental impact of aging on speaker verification performance.
引用
收藏
页码:2414 / 2424
页数:11
相关论文
共 36 条
  • [1] [Anonymous], 2010, P OD 2010 SPEAK LANG
  • [2] Speaker age estimation using i-vectors
    Bahari, Mohamad Hasan
    McLaren, Mitchell
    Hugo Van Hamme
    van Leeuwen, David A.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 34 : 99 - 108
  • [3] Bansé D, 2014, INTERSPEECH, P368
  • [4] Beck J. M., 2010, HDB PHONETIC SCI, P153, DOI DOI 10.1002/9781444317251.CH5
  • [5] Bowie D., 2000, The Effect of Geographic Mobility on the Retention of a Local Dialect
  • [6] Application-independent evaluation of speaker detection
    Brümmer, N
    du Preez, J
    [J]. COMPUTER SPEECH AND LANGUAGE, 2006, 20 (2-3) : 230 - 275
  • [7] Longitudinal voice changes: Facts and interpretation
    Decoster, W
    Debruyne, F
    [J]. JOURNAL OF VOICE, 2000, 14 (02) : 184 - 193
  • [8] Front-End Factor Analysis for Speaker Verification
    Dehak, Najim
    Kenny, Patrick J.
    Dehak, Reda
    Dumouchel, Pierre
    Ouellet, Pierre
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 788 - 798
  • [9] Demsar J, 2006, J MACH LEARN RES, V7, P1
  • [10] Supervector Dimension Reduction for Efficient Speaker Age Estimation Based on the Acoustic Speech Signal
    Dobry, Gil
    Hecht, Ron M.
    Avigal, Mireille
    Zigel, Yaniv
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 1975 - 1985