Compressed Domain Speech Enhancement based on the Joint Modification of Codebook Gains

被引:0
作者
Xia, Bing-yin [1 ]
Bao, Chang-chun [1 ]
Liang, Yan [1 ]
Zhou, Xuan [1 ]
He, Yu-wen [1 ]
Li, Ru-wei [1 ]
机构
[1] Beijing Univ Technol, Speech & Audio Signal Proc Lab, Sch Elect Informat & Control Engn, Beijing 100124, Peoples R China
来源
2011 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT) | 2011年
关键词
speech enhancement; compressed domain; CELP; codebook gain; joint modification;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A compressed domain speech enhancement method based on the joint modification of adaptive and algebraic codebook gains for the codec of ITU-T G.722.2 is proposed in this paper. First the power of excitation signal corresponding to the noise is estimated by the method of minimum statistics. Then the decision-directed approach is used to get an estimate of the a priori SNR. And the algebraic codebook gain is modified by multiplying a Wiener-type modification factor. In order to solve the problem of power loss in voiced segment, the modified adaptive codebook gain is got by keeping the power of the modified excitation signal equal to the scaled version of the noisy one. The result of performance evaluation under ITU-T G.160 shows that, in comparison with the method that only modifies the algebraic codebook gain, the proposed method could provide larger amount of noise reduction in both white and colored noise with smaller attenuation on the speech level, and the objective speech quality is improved evidently.
引用
收藏
页码:207 / 211
页数:5
相关论文
共 8 条
[1]  
[Anonymous], 2005, VOIC ENH DEV MOB NET
[2]  
[Anonymous], 2001, ITU-T Rec. P. 862
[3]  
Chandran R, 2000, PROCEEDINGS OF THE 43RD IEEE MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I-III, P10, DOI 10.1109/MWSCAS.2000.951575
[4]  
Duetsch N., 2004, 5 ITG FACHB JAN GERM, P357
[5]  
ITU-T, 2003, WID COD SPEECH AR 16
[6]   Noise power spectral density estimation based on optimal smoothing and minimum statistics [J].
Martin, R .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (05) :504-512
[7]  
Sukkar R.A., 2006, United States Patent Application, Patent No. [US 2006/0217970 Al, 20060217970]
[8]  
Taddei H, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, P497