MULTIBAND CODE-EXCITED LINEAR PREDICTION (MBCELP) FOR SPEECH CODING

被引:0
作者
YANG, G [1 ]
LEICH, H [1 ]
BOITE, R [1 ]
机构
[1] FAC POLYTECH MONS,THEORIE CIRCUITS & TRAITEMENT SIGNAUX LAB,31 BLVD DOLEZ,B-7000 MONS,BELGIUM
关键词
SPEECH CODING; LINEAR PREDICTIVE CODING; CELP CODING; SPEECH PERCEPTION; SIGNAL PROCESSING;
D O I
10.1016/0165-1684(93)90067-K
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a new speech coding model targeted at the bit-rate above 4 kbit/s, referred to as multiband code-excited linear prediction (MBCELP). The analysis and synthesis of speech are accomplished in the time domain by comparing the original to the synthetic speech while a perceptual criterion is used. A usual short-term linear predictive filter is employed as the synthesis filter; the excitation signal is modelled as a linear combination of a long-term predictive excitation, periodic multiband excitations and a noise-like excitation; no voiced/unvoiced decision is required. The periodic multiband excitation is produced by convoluting a periodic impulse sequence with a sinc function corresponding to a frequency band; the noise-like excitation is represented by a codebook. We estimate a pitch which is appropriate not only to the long-term predictive filter but also to the periodic multiband excitations and to the 'pitch' prefilter in the decoder. Several CELP vocoders are developed as a reference to test the property of the MBCELP vocoder. Listening tests clearly indicate that this vocoder reconstructed very high quality speech without 'buzziness' or 'hoarseness' for both clean and noisy speech. A 4.8 kbit/s MBCELP vocoder is shown as an example. Its perceptual quality is virtually identical to the original 8 kbit/s CELP vocoder and the improved 7.2 kbit/s CELP vocoder. Since less subframes are used for the MBCELP vocoders, their complexity is not greater than that of usual CELP vocoders with the same type of codebook. A lot of techniques used to simplify CELP coding can be also adopted for the MBCELP coding.
引用
收藏
页码:215 / 227
页数:13
相关论文
共 22 条
[1]  
Atal, Caspers, Beyond multipulse and CELP towards high quality speech at 4 kb/s, Advances in Speech Coding, pp. 191-201, (1991)
[2]  
Atal, Hanauer, Speech analysis and synthesis by linear prediction of the speech wave, The Journal of the Acoustical Society of America, 50, pp. 637-655, (1971)
[3]  
Atal, Remde, A new model of LPC excitation for producing natural-sounding speech at low bit rates, Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., pp. 614-617, (1982)
[4]  
Atal, Schroeder, Stochastic coding of speech at very low bit rates, Proc. ICC, pp. 1610-1613, (1984)
[5]  
Boite, Leich, Yang, Simplification and improvement of the binary coded excited linear prediction (BCELP) for speech coding, EUSIPCO-90, (1990)
[6]  
Boite, Leich, Yang, A very simple and efficient weighting filter with application to a CELP coder for high quality speech at 4800 bits/s, Signal Processing, 27, 2, pp. 109-116, (1992)
[7]  
Deprettere, Kroon, Regular excitation reduction for effective and efficient LP-coding of speech, Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., pp. 25.8.1-25.8.4, (1985)
[8]  
Gerson, Jasiuk, Vector sum excited linear prediction (VSELP) speech coding at 4.8 kbps, Internat. Mobile Satellite Conference, (1990)
[9]  
Griffin, Lim, Multiband excitation vocoder, IEEE Trans. Acoust. Speech Signal Process., 36, 8, (1988)
[10]  
Hassanein, Brind'Amour, Bryden, A 4800 bps CELP vocoder with an improved excitation, Internat. Mobile Satellite Conference, (1990)