MULTIBAND CODE-EXCITED LINEAR PREDICTION (MBCELP) FOR SPEECH CODING

被引:0
作者
YANG, G [1 ]
LEICH, H [1 ]
BOITE, R [1 ]
机构
[1] FAC POLYTECH MONS,THEORIE CIRCUITS & TRAITEMENT SIGNAUX LAB,31 BLVD DOLEZ,B-7000 MONS,BELGIUM
关键词
SPEECH CODING; LINEAR PREDICTIVE CODING; CELP CODING; SPEECH PERCEPTION; SIGNAL PROCESSING;
D O I
10.1016/0165-1684(93)90067-K
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a new speech coding model targeted at the bit-rate above 4 kbit/s, referred to as multiband code-excited linear prediction (MBCELP). The analysis and synthesis of speech are accomplished in the time domain by comparing the original to the synthetic speech while a perceptual criterion is used. A usual short-term linear predictive filter is employed as the synthesis filter; the excitation signal is modelled as a linear combination of a long-term predictive excitation, periodic multiband excitations and a noise-like excitation; no voiced/unvoiced decision is required. The periodic multiband excitation is produced by convoluting a periodic impulse sequence with a sinc function corresponding to a frequency band; the noise-like excitation is represented by a codebook. We estimate a pitch which is appropriate not only to the long-term predictive filter but also to the periodic multiband excitations and to the 'pitch' prefilter in the decoder. Several CELP vocoders are developed as a reference to test the property of the MBCELP vocoder. Listening tests clearly indicate that this vocoder reconstructed very high quality speech without 'buzziness' or 'hoarseness' for both clean and noisy speech. A 4.8 kbit/s MBCELP vocoder is shown as an example. Its perceptual quality is virtually identical to the original 8 kbit/s CELP vocoder and the improved 7.2 kbit/s CELP vocoder. Since less subframes are used for the MBCELP vocoders, their complexity is not greater than that of usual CELP vocoders with the same type of codebook. A lot of techniques used to simplify CELP coding can be also adopted for the MBCELP coding.
引用
收藏
页码:215 / 227
页数:13
相关论文
共 22 条
[11]  
Kleijn, Krasinski, Ketchum, Improved speech quality and efficient vector quantization in SELP, Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., pp. 155-158, (1988)
[12]  
Kleijn, Krasinski, Krtchum, Fast method for the CELP speech coding algorithm, IEEE Trans. Acoust. Speech Signal Process., 38, pp. 1330-1342, (1990)
[13]  
Kroon, Atal, On the use of pitch predictors with high temporal resolution, IEEE Transactions on Signal Processing, 39, 3, (1991)
[14]  
Kroon, Deprettere, Sluyter, Regular-pulse excitation — A novel approach to effective and efficient multipulse coding of speech, IEEE Trans. Acoust. Speech Signal Process., 34 ASSP, 5, (1986)
[15]  
Markel, The SIFT algorithm for fundamental frequency estimation, IEEE Transactions on Audio and Electroacoustics, 20 AU, 5, pp. 367-377, (1972)
[16]  
Marques, Tribolet, Pitch prediction with fractional delays in CELP coding, Proc. EUROSPEECH, pp. 509-512, (1989)
[17]  
Salami, Binary code excited linear prediction (BCELP): New approach to CELP coding of speech without codebooks, Electronics Letters, 25, 6, (1989)
[18]  
Salami, Binary pulse excitation: A novel approach to low complexity CELP coding, Advances in Speech Coding, pp. 145-156, (1991)
[19]  
Shoham, Constrained-stochastic excitation coding of speech at 4.8 kb/s, Advances in Speech Coding, pp. 339-348, (1991)
[20]  
Southcott, Boyd, Coleman, Hammett, Low bit rate speech coding for practical applications, Br. Telecom. Technol. J., 6, 2, (1988)