Speech Quality Estimation Models and trends

被引:91
作者
Moeller, Sebastian [1 ]
Chan, Wai-Yip [2 ,3 ]
Cote, Nicolas [1 ,4 ]
Falk, Tiago H. [5 ]
Raake, Alexander [1 ,6 ,7 ]
Waeltermann, Marcel
机构
[1] Tech Univ Berlin, Deutsch Telekom Labs, D-1000 Berlin, Germany
[2] McGill Univ, Montreal, PQ H3A 2T5, Canada
[3] IIT, Chicago, IL 60616 USA
[4] France Telecom R&D, Lannion, France
[5] Inst Natl Rech Sci INRS EMT, Montreal, PQ, Canada
[6] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
[7] LIMSI CNRS, Paris, France
关键词
D O I
10.1109/MSP.2011.942469
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article presents a tutorial overview of models for estimating the quality experienced by users of speech transmission and communication services. Such models can be classified as either parametric or signal based. Signal-based models use input speech signals measured at the electrical or acoustic interfaces of the transmission channel. Parametric models, on the other hand, depend on signal and system parameters estimated during net work planning or at run time. This tutorial describes the under lying principles as well as advantages and limitations of existing models. It also presents new developments, thus serving as a guide to an appropriate usage of the multitude of current and emerging speech quality models. © 2011 IEEE.
引用
收藏
页码:18 / 28
页数:11
相关论文
共 64 条
[1]  
*AM NAT STAND I, 2006, ATISPP01000052006 AM
[2]  
[Anonymous], 2003, P835 ITUT
[3]  
[Anonymous], 1988, Objective measures of speech quality
[4]  
[Anonymous], 1996, Recommendation ITU-T P.800
[5]  
Appel R, 2002, J AUDIO ENG SOC, V50, P237
[6]   Estimation of 'quality per call' modelled telephone conversations [J].
Berger, J. ;
Hellenbart, A. ;
Ullmann, R. ;
Weiss, B. ;
Moeller, S. ;
Gustafsson, J. ;
Heikkila, G. .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :4809-4812
[7]  
Blauert J., 1997, Spatial hearing: the psychophysics of human sound localization
[8]   VoIP quality assessment: Taking account of the edge-device [J].
Broom, Simon R. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06) :1977-1983
[9]  
Claret A., 2001, Physical processes in close binary systems, P1
[10]  
Côté N, 2010, 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, P1317