EXTENSION OF THE E-MODEL TOWARDS SUPER-WIDEBAND SPEECH TRANSMISSION

被引:10
作者
Waeltermann, Marcel [1 ]
Tucker, Izabela [1 ]
Raake, Alexander [1 ]
Moeller, Sebastian [1 ]
机构
[1] TU Berlin, Deutsch Telekom Labs, Qual & Usabil Lab, Berlin, Germany
来源
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年
关键词
Modeling; Speech codecs; Linear systems; Nonlinear systems; Nonlinear distortion;
D O I
10.1109/ICASSP.2010.5495199
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, the quality gain of super-wideband (SWB) speech, transmitted in the much wider frequency range of 50-14000 Hz compared to the standard 300-3400 Hz narrowband, is quantified employing the E-model framework, a parametric tool for speech quality prediction. Based on two listening experiments, a linear extrapolation of the E-model transmission rating scale was found that leads to a maximum quality advantage of 39% relative to wideband (50-7000 Hz) transmission, and 79% relative to narrowband. Furthermore, narrowband, wideband, and super-wideband conditions can be quantified on this universal quality scale. Equipment Impairment Factors were derived and discussed for several SWB codecs. It will further be shown that a model quantifying the quality impact of linear distortions, reflected by the Bandwidth Impairment Factor, can successfully applied to SWB conditions. The correlation between the overall impairment and the model predictions amounts to r = 0.977 for linearly distorted speech samples.
引用
收藏
页码:4654 / 4657
页数:4
相关论文
共 9 条
[1]  
[Anonymous], 1996, ITU-T Recommendation H.263
[2]  
*ITU T, 2005, P862 ITUT
[3]  
*ITU T, 2008, P8331 ITUT
[4]  
*ITU T, 2008, G107 ITUT
[5]   Impairment factor framework for wide-band speech codecs [J].
Moeller, Sebastian ;
Raake, Alexander ;
Kitawaki, Nobuhiko ;
Takahashi, Akira ;
Waeltermann, Marcel .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06) :1969-1976
[6]  
RAAKE A, 2009, P 127 AES CONV NEW Y
[7]  
Raake A., 2006, Speech quality of VoIP - Assessment and Prediction
[8]  
SCHOLZ K, P ICSLP 2006 PITTSBU, P1523
[9]  
WALTERMANN M, P ICASSP 2008 LAS VE