Speech quality evaluation: A new application of digital watermarking

被引:7
作者
Cai, Libin [1 ]
Tu, Ronghui [1 ]
Zhao, Jiying [1 ]
Mao, Yongyi [1 ]
机构
[1] Univ Ottawa, Sch Informat Technol & Engn, Ottawa, ON K1N 6N5, Canada
关键词
mean opinion score (MOS); perceptual evaluation of speech quality (PESQ); speech quality evaluation; watermarking;
D O I
10.1109/TIM.2006.887773
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech quality evaluation is an important research topic. The traditional way for speech quality evaluation is using subjective tests. They are reliable, but very expensive, time consuming, and cannot be used in certain applications such as online monitoring. Objective models, based on human perception, were developed to predict the results of subjective tests. The existing objective methods require either the original speech or complicated computation model, which makes some applications of quality evaluation impossible. In this paper, we propose a novel speech quality evaluation method using digital watermarking. Our algorithm evaluates the speech quality without the need of reference speech or any computational model. The watermark is embedded in the discrete wavelet domain or temporal domain. of a speech signal by using quantization technique. This algorithm can evaluate perceptual quality of speech that is distorted by Gaussian noise, MP3 compression, low-pass filtering, and packet loss. The experimental results show that the method yields accurate quality scores which are very close to the results of the perceptual evaluation of speech quality.
引用
收藏
页码:45 / 55
页数:11
相关论文
共 16 条
  • [1] CAI L, 2005, P IMTC2005 OTT ON CA, P726
  • [2] Dither modulation: a new approach to digital watermarking and information embedding
    Chen, B
    Wornell, GW
    [J]. SECURITY AND WATERMARKING OF MULTIMEDIA CONTENTS, 1999, 3657 : 342 - 353
  • [3] DING L, 2003, P IEEE GLOB C DEC, V7, P3974
  • [4] Assessment of effects of packet loss on speech quality in VoIP
    Ding, LJ
    Goubran, RA
    [J]. 2ND IEEE INTERNATIONAL WORKSHOP ON HAPTIC, AUDIO AND VISUAL ENVIRONMENTS AND THEIR APPLICATIONS - HAVE 2003, 2003, : 49 - 54
  • [5] FALK T, 2004, P 22 BIENN S COMM KI, P169
  • [6] Falk TH, 2005, INT CONF ACOUST SPEE, P125
  • [7] Quantization
    Gray, RM
    Neuhoff, DL
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (06) : 2325 - 2383
  • [8] HOENE C, 2004, SPECTS 04 SAN JOS CA
  • [9] *ITU T STUD GROUP, 1996, 1274 ITUT STUD GROUP
  • [10] Kim DS, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS, P1060