ADAPTIVE POSTFILTERING FOR QUALITY ENHANCEMENT OF CODED SPEECH

被引:99
作者
CHEN, JH
GERSHO, A
机构
[1] Speech Coding Research Department, AT&T Bell Laboratories, Murray Hill, NJ
[2] Center for Information Processing Research, Department of Electrical and Computer Engineering, University of California, Santa, Barbara
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1995年 / 3卷 / 01期
关键词
D O I
10.1109/89.365380
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An adaptive postfiltering algorithm for enhancing the perceptual quality of coded speech is presented, The postfilter consists of a long-term postfilter section in cascade with a shortterm postfilter section and includes spectral tilt compensation and automatic gain control, The long-term section emphasizes pitch harmonics and attenuates the spectral valleys between pitch harmonics, The short-term section, on the other hand, emphasizes speech formants and attenuates the spectral valleys between formants, Both filter sections have poles and zeros, Unlike earlier postfilters that often introduced a substantial amount of muffling to the output speech, our postfilter significantly reduces this effect by minimizing the spectral tilt in its frequency response, As a result, this postfilter achieves noticeable noise reduction while introducing only minimal distortion in speech, The complexity of the postfilter is quite low. Variations of this postfilter are now being used in several national and international speech coding standards, This paper presents for the first time a complete description of our original postfiltering algorithm and the underlying ideas that motivated its development.
引用
收藏
页码:59 / 71
页数:13
相关论文
共 39 条
  • [1] Chen J.-H., Low-bit-rate predictive coding of speech waveforms based on vector quantization, Ph.D. dissertation, (1987)
  • [2] Atal B.S., Schroeder M.R., Predictive coding of speech and subjective error criteria, IEEE Trans. Acoust., ASSP-27, pp. 247-254, (1979)
  • [3] Schroeder M.R., Atal B.S., Hall J.L., Optimizing digital speech coders by exploiting masking properties of the human ear, J. Acoust. Soc. Amer., 66, pp. 1647-1652, (1979)
  • [4] Makhoul J., Berouti M., Adaptive noise spectral shaping and entropy coding in predictive coding of speech, IEEE Trans. Acoust., ASSP-27, pp. 247-254, (1979)
  • [5] Atal B.S., Predictive coding of speech at low bit rates, IEEE Trans. Commun., COM-30, 4, pp. 600-614, (1982)
  • [6] Atal B.S., Remde J.R., A new model of LPC excitation for producing natural-sounding speech at low bit rates, Proc. IEEE ICASSP, pp. 614-617, (1982)
  • [7] Schroeder M.R., Atal B.S., Code-excited linear prediction (CELP): High quality speech at very low bit rates, Proc. IEEE ICASSP, pp. 937-940, (1985)
  • [8] Tobias J.V., Foundations of Modern Auditory Theory. New York and London: Academic, (1970)
  • [9] O'Shaughnessy D., Speech Communication: Human and Machine. Reading, (1987)
  • [10] Schroeder M.R., U.S. Patent No. 3 180936, 3, (1965)