Speech recognition performance as an effective perceived quality predictor

被引:11
作者
Jiang, WY [1 ]
Schulzrinne, H [1 ]
机构
[1] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
来源
2002 TENTH IEEE INTERNATIONAL WORKSHOP ON QUALITY OF SERVICE | 2002年
关键词
perceived quality; speech recognition; packet audio; Internet telephony; subjective listening test; speech intelligibility; quality of service;
D O I
10.1109/IWQoS.2002.1006595
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Determining the perceived quality of packet audio under packet loss usually requires human-based Mean Opinion Score (MOS) listening tests. We propose a new MOS estimation method based on machine speech recognition. Its automated, machine-based nature facilitates real-time monitoring of transmission quality without the need to conduct time-consuming listening tests. Our evaluation of this new method shows that it can use the word recognition ratio metric to reliably predict perceived quality. In particular, we find that although the absolute word recognition ratio of a speech recognizer may vary depending on the speaker, the relative word recognition ratio, obtained by dividing the absolute word recognition ratio with its own value at 0% loss, is speaker-independent. Therefore the relative word recognition ratio is well suited as a universal, speaker-independent MOS predictor. Finally we have also conducted human-based word recognition tests and examined its relationship with machine-based recognition results. Our analysis shows that they are correlated although not very linearly. Also we find that human-based word recognition ratio does not degrade significantly once packet loss is large (greater than or equal to 10%).
引用
收藏
页码:269 / 275
页数:7
相关论文
共 14 条
[1]  
Bellamy J.C., 2000, DIGITAL TELEPHONY, V3rd
[2]  
BOLOT JC, 1999, P C COMP COMM IEEE I
[3]  
Chernick C. M., 1999, IEEE INT MIL COMM C
[4]  
CHERNICK M, 1999, CAN SPEECH RECOGNIZE
[5]  
*IBM, SMAPI US GUID IBM VI
[6]  
*IBM, IBM VIAV ASR SDK LIN
[7]  
Jayant N. S., 1984, DIGITAL CODING WAVEF
[8]  
JIANG WY, 2001, CUCS00901 DEP COMP S
[9]  
*OFF TECHN STAND, 1990, FS1016 OFF TECHN STA
[10]  
Redl S., 1995, INTRO GSM