Classified Comfort Noise Generation for Efficient Voice Transmission

被引:0
作者
Qian, Yasheng [1 ]
Hsu, Wei-Shou [1 ]
Kabal, Peter [1 ]
机构
[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ H3A 2A7, Canada
来源
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年
关键词
Comfort Noise; Gaussian Mixture classifier; classified prototype codebook; enhanced classified excitation codebook; soft-decision Gaussian mixture classifier;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Comfort noise insertion during speech pause has been applied to Voice-over-IP and wireless networks for increasing bandwidth efficiency. We present two classified comfort noise generation (CCNG) schemes using Gaussian Mixture classifiers (GMM-C). Our first scheme employs a classified prototype background noise codebook with the prototype noise waveform chosen using a GMM-C. The second scheme utilizes a classified enhanced excitation codebook. The new CCNG algorithms provide better comfort noise during speech pauses and a smaller misclassification rate. We have retrofitted the scheme into existing speech transmission system, such as ITU-T 6.711/Appendix II and 6.723.1/Annex A. The perceived quality of a voice conversation of the novel system has been noticeably enhanced for car and babble noise. For the 6.711 system, a large improvement is obtained for car noise while the largest amelioration is for babble noise in the 6.723.1 system.
引用
收藏
页码:225 / 228
页数:4
相关论文
共 7 条
[1]  
*3GPP2, 1996, TIAIS127 3GPP2
[2]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[3]  
ELMALEH K, 2004, Patent No. 6782361
[4]  
GIERLICH HW, 2001, P INT WORKSH AC ECH
[5]  
*ITU T, 1996, G7231 ITUT
[6]  
*ITU T, 2000, G711 ITUT
[7]  
RUGGERI G, 2001, P IEEE INT C AC SPEE, V2