Speaker age estimation using i-vectors

被引:51
作者
Bahari, Mohamad Hasan [1 ]
McLaren, Mitchell [2 ]
Hugo Van Hamme [1 ]
van Leeuwen, David A. [2 ]
机构
[1] Katholieke Univ Leuven, Ctr Proc Speech & Images, Louvain, Belgium
[2] Radboud Univ Nijmegen, Ctr Language & Speech Technol, NL-6525 ED Nijmegen, Netherlands
关键词
Speaker age estimation; i-vector; Least squares support vector regression; Utterance length; Language mismatch; GENDER RECOGNITION; GMM SUPERVECTORS; SUPPORT;
D O I
10.1016/j.engappai.2014.05.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a new approach for age estimation from speech signals based on i-vectors is proposed. In this method, each utterance is modeled by its corresponding i-vector. Then, a Within-Class Covariance Normalization technique is used for session variability compensation. Finally, a least squares support vector regression (LSSVR) is applied to estimate the age of speakers. The proposed method is trained and tested on telephone conversations of the National Institute for Standard and Technology (NIST) 2010 and 2008 speaker recognition evaluation databases. Evaluation results show that the proposed method yields significantly lower mean absolute error and higher Pearson correlation coefficient between chronological speaker age and estimated speaker age compared to different conventional schemes. The obtained relative improvements of mean absolute error and correlation coefficient compared to our best baseline system are around 5% and 2% respectively. Finally, the effect of some major factors influencing the proposed age estimation system, namely utterance length and spoken language are analyzed. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:99 / 108
页数:10
相关论文
共 49 条
[1]  
[Anonymous], P INTERSPEECH
[2]  
[Anonymous], 2006, PERCEPTION ANAL SYNT
[3]  
[Anonymous], P INT EUR
[4]  
[Anonymous], 2011, 2011 IEEE WORKSH BIO, DOI DOI 10.1109/BIOMS.2011.6052385
[5]  
[Anonymous], 1999, P ICPHS
[6]  
[Anonymous], P INT
[7]  
[Anonymous], P INTERSPEECH
[8]  
[Anonymous], P AM STAT ASS STAT C
[9]  
[Anonymous], 1966, Applied regression analysis
[10]  
[Anonymous], 2010, P 11 ANN C INT SPEEC