THEORETICAL-ANALYSIS OF THE HIGH-RATE VECTOR QUANTIZATION OF LPC PARAMETERS

被引:116
作者
GARDNER, WR
RAO, BD
机构
[1] QUALCOMM Incorporated, University of California, San Diego, San Diego, CA 92121 San Diego, CA 92093-
[2] University of California at San Diego, San Diego
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1995年 / 3卷 / 05期
基金
美国国家科学基金会;
关键词
D O I
10.1109/89.466658
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a theoretical analysis of high-rate vector quantization (VQ) systems that use suboptimal, mismatched distortion measures, and describes the application of the analysis to the problem of quantizing the linear predictive coding (LPC) parameters in speech coding systems, First, it is shown that in many high-rate VQ systems the quantization distortion approaches a simple quadratically weighted error measure, where the weighting matrix is a ''sensitivity matrix'' that is an extension of the concept of the scalar sensitivity. The approximate performance of VQ systems that train and quantize using mismatched distortion measures is derived, and is used to construct better distortion measures, Second, these results are used to determine the performance of LPC vector quantizers, as measured by the log spectral distortion (LSD) measure, which have been trained using other error measures, such as mean-squared (MSE) or weighted mean-squared error (WMSE) measures of LPC parameters, reflection coefficients and transforms thereof, and line spectral pair (LSP) frequencies, Computationally efficient algorithms for computing the sensitivity matrices of these parameters are described. In particular, it is shown that the sensivity matrix for the LSP frequencies is diagonal, implying that a WMSE measure of LSP frequencies converges to the LSD measure in high-rate VQ systems, Experimental results to support the theoretical performance estimates are provided.
引用
收藏
页码:367 / 381
页数:15
相关论文
共 23 条
  • [1] Viswanathan R., Makhoul J., Quantization properties of transmission parameters in linear predictive systems, IEEE Trans. Acoust., ASSP-23, pp. 309-321, (1975)
  • [2] Gray A., Markel J., Quantization and bit allocation in speech processing, IEEE Trans. Acoust., ASSP-24, (1976)
  • [3] Gray A., Gray R., Markel J., Comparison of optimal quantizations of speech reflection coefficients, IEEE Trans. Acoust., ASSP-25, pp. 9-23, (1977)
  • [4] Soong F.K., Juang B.-H., Line spectrum pair (LSP) and speech data compresion, Proc. ICASSP, pp. 1-4, (1984)
  • [5] Optimal quantization of LSP parameters, Proc. ICASSP, pp. 394-397, (1988)
  • [6] Optimal quantization of LSP parameters using delayed decisions, Proc. ICASSP, pp. 185-188, (1990)
  • [7] Atal B., Cox R., Kroon P., Spectral quantization and interpolation for CELP coders, Proc. ICASSP, pp. 69-72, (1989)
  • [8] Buzo A., Gray A.H., Gray R.M., Markel J.D., Speech coding based upon vector quantization, IEEE Trans. Acoust., pp. 562-574, (1980)
  • [9] Paliwal K.K., Atal B.S., Efficient vector quantization of LPC parameters at 24 bits/frame, Proc. ICASSP, pp. 661-664, (1991)
  • [10] Efficient vector quantization of LPC parameters at 24 bits/frame, IEEE Trans. Speech Audio Processing, 1, pp. 3-14, (1993)