Audio enhancement in compressed domain based on MPEG-AAC codec

被引：0

作者：

机构：

[1] School of Electronic Information and Control Engineering, Beijing University of Technology, Beijing

来源：

Deng, Feng | 1600年 / Chinese Institute of Electronics卷 / 42期

关键词：

AAC bit-stream; Audio enhancement; Compressed domain; COSH estimator; MWRA noise estimation;

D O I：

10.3969/j.issn.0372-2112.2014.07.026

中图分类号：

学科分类号：

摘要：

An audio enhancement method based on MPEG-AAC codec in compressed domain is proposed. First, the bit-stream derived from noisy audio signal is decoded to obtain MDCT coefficients of noisy audio signals. Then, the noise power is estimated by modified weighted recursive averaging (MWRA). Next, the adaptive β-order COSH statistic modal method is employed to enhance MDCT coefficients. Finally, the enhanced MDCT coefficients are re-quantized to obtain an enhanced bit-stream which is used to get the enhanced audio signals by AAC decoder. The test results indicate that the proposed algorithm can effectively remove the noises derived from AAC bit-stream of audio signals and obviously outperforms the reference noise reduction methods.

引用

页码：1410 / 1418

页数：8

共 19 条

[1] Boll S.F., Suppression of Acoustic Noise in Speech Using Spectral Subtraction , IEEE Transactions on Acoustics, Speech, & Signal Processing, 27, 2, pp. 113-120, (1979)
[2] Li J.F., Sakamoto S., Hongo S., Et al., Adaptive β-order Generalized Spectral Subtraction for Speech Enhancement , Signal Process, 88, 11, pp. 2764-2776, (2008)
[3] Miyazaki R., Saruwatari H., Inoue T., Et al., Musical-noise-free speechenhancement based on optimized iterative spectral subtraction , IEEE Transactions on Audio, Speech, and Language Processing, 20, 7, pp. 2080-2094, (2012)
[4] Loizou P., Speech enhancement based on perceptually motivated Bayesian estimators of the magnitude spectrum , IEEE Transactions on Speech Audio Process, 13, 5, pp. 857-869, (2005)
[5] You C.H., Et al., Masking-based β-order MMSE speech enhancement , Speech Communication, 48, 1, pp. 57-70, (2006)
[6] Li N., Bao C.C., Et al., Adaptive β-order WEDM spectral amplitude estimator for speech enhancement , IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) , pp. 155-160, (2011)
[7] Donoho D.L., De-noising by Soft-thresholding , IEEE Transactions on Information Theory, 41, 3, pp. 613-627, (1995)
[8] Lin Y.T., Cai. J.L., A new threshold function for signal denoising based on wavelet transform , International Conference on Measuring Technology and Mechatronics Automation (ICMTMA) , pp. 200-203, (2010)
[9] Ghanbari Y., Karami-Mollaei M.R., A new approach for speech enhancement based on the adaptive thresholding of the wavelet packets , Speech Communication, 48, 8, pp. 927-940, (2006)
[10] Soon I.Y., Et al., Noisy speech enhancement using discrete cosine transform , Speech Communication, 24, 3, pp. 249-257, (1998)

← 1 2 →