Assessment of objective quality measures for speech intelligibility estimation

被引:0
作者
Liu, Wei M. [1 ]
Jellyman, Keith A. [1 ]
Mason, John S. D. [1 ]
Evans, Nicholas W. D. [1 ]
机构
[1] Univ Coll Swansea, Sch Engn, Swansea SA2 8PP, W Glam, Wales
来源
2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13 | 2006年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates the accuracy of automatic speech recognition (ASR) and 6 other well-reported objective quality measures for the task of estimating speech intelligibility. It is believed to be the first assessment of such a range of measures side-by-side and in the context of intelligibility. A total of 39 degradation conditions including those from a newly proposed low bit rate (0.3 to 1.5kbps) codec and a noise suppression system are considered. They provide real and varied scenarios to assess the measures. The objective scores are compared to subjective listening scores, and their correlation used to assess the approach. All tests are conducted on the European standard Aurora 2 corpus. Experiments show that ASR and perceptual estimation of speech quality (PESQ) are potentially reliable estimators of intelligibility with subjective correlation as high as 0.99 and 0.96 respectively. Furthermore, ASR gives a trend corresponding to that of subjective intelligibility assessment for the different configurations of the new codec, while most others fail.
引用
收藏
页码:1225 / 1228
页数:4
相关论文
共 17 条
[1]  
[Anonymous], 2001, P862 ITUT
[2]  
BEERENDS JG, 2002, J AUDIO ENG SOC, V50
[3]  
Chernick C. M., 1999, IEEE INT MIL COMM C
[4]   FACTORS GOVERNING THE INTELLIGIBILITY OF SPEECH SOUNDS [J].
FRENCH, NR ;
STEINBERG, JC .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1947, 19 (01) :90-119
[5]  
HIRSCH HG, 2000, ISCA ITRW ASR 2000
[6]  
HOLUB J, 2003, MEASUREMENT SPEECH A, P47
[7]  
HOUTGAST T, 1973, ACUSTICA, V28, P66
[8]   Channel and source considerations of a bit-rate reduction technique for a possible wireless communications system's performance enhancement [J].
Ilk, HG ;
Tugaç, S .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2005, 4 (01) :93-99
[9]  
ITU-T Rec. P.800, 1996, P800 ITUT
[10]   Speech recognition performance as an effective perceived quality predictor [J].
Jiang, WY ;
Schulzrinne, H .
2002 TENTH IEEE INTERNATIONAL WORKSHOP ON QUALITY OF SERVICE, 2002, :269-275