Automatic Detection of High Vocal Effort in Telephone Speech

被引:0
作者
Pohjalainen, Jouni [1 ]
Raitio, Tuomo [1 ]
Pulakka, Hannu [1 ]
Alku, Paavo [1 ]
机构
[1] Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland
来源
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年
关键词
vocal effort detection; speech analysis;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A system is proposed for the automatic detection of high vocal effort in speech. The system is evaluated using both PCM-coded speech and AMR-coded telephone speech. In addition, the effect of far-end noise in the telephone conditions is studied using both matched-condition training and cases with additive noise mismatch. The proposed system is based on Bayesian classification of mel-frequency cepstral feature vectors. Concerning the MFCC feature extraction process, the substitution of a spectrum analysis method emphasizing the fine structure improves the results in the noisy cases.
引用
收藏
页码:690 / 693
页数:4
相关论文
共 22 条
[1]  
[Anonymous], 2011, 26090 3GPP TS
[2]  
[Anonymous], 2007, P IEEE INT C ADV VID
[3]  
[Anonymous], 2010, G191 ITUT
[4]  
[Anonymous], P INTERSPEECH
[5]  
Bengio S., 2004, P ODYSSEY04 TOL SPAI
[6]  
Harwardt C., 2011, P INTERSPEECH
[7]   THE LOMBARD REFLEX AND ITS ROLE ON HUMAN LISTENERS AND AUTOMATIC SPEECH RECOGNIZERS [J].
JUNQUA, JC .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1993, 93 (01) :510-524
[8]   A New Initialization Technique for Generalized Lloyd Iteration [J].
Katsavounidis, Ioannis ;
Kuo, C. -C. Jay ;
Zhang, Zhen .
IEEE SIGNAL PROCESSING LETTERS, 1994, 1 (10) :144-146
[9]  
Keronen S., 2011, P INTERSPEECH
[10]   Effect of vocal effort on spectral properties of vowels [J].
Liénard, JS ;
Di Benedetto, MG .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (01) :411-422