Towards robust automatic evaluation of pathologic telephone speech

被引:14
作者
Riedhammer, K. [1 ]
Stemmer, G. [2 ]
Haderlein, T. [1 ,3 ]
Schuster, M. [3 ]
Rosanowski, F. [3 ]
Noeth, E. [1 ]
Maier, A. [1 ,3 ]
机构
[1] Univ Erlangen Nurnberg, Lehrstuhl Mustererkennung, Martensstr 3, D-91058 Erlangen, Germany
[2] Corp Technol, Siemens AG, CT IC5, D-81730 Munich, Germany
[3] Univ Erlangen Nurnberg, Abteilung Phoniatrie & Padaudiol, D-91054 Erlangen, Germany
来源
2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2 | 2007年
关键词
biomedical acoustics; speech intelligibility; speech processing; acoustic applications;
D O I
10.1109/ASRU.2007.4430200
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For many aspects of speech therapy an objective evaluation of the intelligibility of a patient's speech is needed. We investigate the evaluation of the intelligibility of speech by means of automatic speech recognition. Previous studies have shown that measures like word accuracy are consistent with human experts' ratings. To ease the patient's burden, it is highly desirable to conduct the assessment via phone. However, the telephone channel influences the quality of the speech signal which negatively affects the results. To reduce inaccuracies, we propose a combination of two speech recognizers. Experiments on two sets of pathological speech show that the combination results in consistent improvements in the correlation between the automatic evaluation and the ratings by human experts. Furthermore, the approach leads to reductions of 10% and 25% of the maximum error of the intelligibility measure.
引用
收藏
页码:717 / +
页数:2
相关论文
共 14 条
[1]  
[Anonymous], 2005, Data Mining Pratical Machine Learning Tools and Techniques
[2]   A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER) [J].
Fiscus, JG .
1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, :347-354
[3]  
GIULIANI D, 2006, TC STAR WORKSH SPEEC, P151
[4]  
Huan Liu, 1996, Machine Learning. Proceedings of the Thirteenth International Conference (ICML '96), P319
[5]  
Likert R, 1932, ARCH PSYCHOL, V140
[6]  
MAIER A, 2007, IN PRESS INT 2007 P
[7]  
Maier A., 2006, Proc. of the 5th Slovenian and 1st International Conference on Language Technologies, P31
[8]   On lines and planes of closest fit to systems of points in space. [J].
Pearson, Karl .
PHILOSOPHICAL MAGAZINE, 1901, 2 (7-12) :559-572
[9]  
RIEDHAMMER K, 2006, P 5 SLOV 1 INT C LAN, P17
[10]  
Schuster M, 2005, INT CONF ACOUST SPEE, P61