Quality Estimation of Noisy Speech Using Spectral Entropy Distance

被引:0
作者
Mittag, Gabriel [1 ]
Moeller, Sebastian [1 ,2 ]
机构
[1] Tech Univ Berlin, Qual & Usabil Lab, Berlin, Germany
[2] DFKI, Language Technol, Berlin, Germany
来源
2019 26TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS (ICT) | 2019年
关键词
speech quality; spectral entropy; QoE; QoS; noise;
D O I
10.1109/ict.2019.8798783
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose to use spectral entropy distance as a new measure for objective quality estimations of noisy speech. While the perceived quality estimation of a transmitted speech signal under background noise is fairly straight forward, the estimation of noise on active speech is more complex. For example, an increase in loudness can be confused as noise by common quality measures. Also, other distortions, such as interruptions due to packet loss, can decrease the energy in the degraded signal and thus lead to an underestimation of the noisiness. This is especially critical when the noise is only present during active speech segments, as it is the case for quantization noise caused by low bitrate codecs or voice activity detections at the receiver side. The spectral entropy, however, only considers the frequency composition of a signal and does not depend on the signal energy. Therefore, it gives a robust measure of how noisy a signal is in the presence of active speech. In our experiments, we trained a prediction model based on the spectral entropy and obtained excellent prediction results that show that the spectral entropy distance is indeed a useful tool for the quality estimation of noisy speech.
引用
收藏
页码:197 / 201
页数:5
相关论文
共 19 条
  • [1] [Anonymous], PSOPHOMETER USE TELE
  • [2] [Anonymous], METHODS METRICS PROC
  • [3] [Anonymous], 1988, Objective measures of speech quality
  • [4] [Anonymous], 1972, BELL SYSTEM TECHNICA
  • [5] [Anonymous], P800 ITUT
  • [6] [Anonymous], 103281 ETSI TS
  • [7] [Anonymous], MEASUREMENT AUDIO FR
  • [8] [Anonymous], SUBJECTIVE TEST METH
  • [9] [Anonymous], MODULATED NOISE REFE
  • [10] Beerends JG, 2013, J AUDIO ENG SOC, V61, P366