A Robust Time-Frequency Decomposition Model for Suppression of Mixed Gaussian-Impulse Noise in Audio Signals

被引:6
作者
Tong, Renjie [1 ]
Zhou, Yingyue [1 ]
Zhang, Long [1 ]
Bao, Guangzhao [1 ]
Ye, Zhongfu [1 ]
机构
[1] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230027, Peoples R China
关键词
Articulation-oriented; degraded; discriminative orthogonal matching pursuit (DOMP); Gaussian-impulse noise; restoration; robust time-frequency decomposition (RTFD); RESTRICTED ISOMETRY PROPERTY; SPEECH; SEPARATION; DICTIONARIES; ENHANCEMENT; RECOVERY; SYSTEMS;
D O I
10.1109/TASLP.2014.2371544
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a robust time-frequency decomposition (RTFD) model to restore audio signals degraded by sparse impulse noise mixed with small dense Gaussian noise. This kind of noise is very common especially in old-time recordings. The proposed RTFD model is based on the observation that these degraded audio signals mainly contain four parts, i.e., the quasi-periodic and voiced part, the aperiodic and transient part, the arbitrarily large impulse noise and the small dense Gaussian noise. Sparsity and local correlations of corresponding parts are exploited to solve the RTFD model. We also heuristically develop a discriminative orthogonal matching pursuit (DOMP) algorithm to more precisely estimate sparse representing vectors. Specifically, the DOMP algorithm divides the whole atom set into two subsets, i.e., the active subset and the passive subset. Atoms in two subsets are treated discriminatively since sparsity regularization terms are not equally weighted. Based on RTFD and DOMP, we have developed two algorithms, i.e., the fidelity-oriented algorithm and the articulation-oriented algorithm. The proposed algorithms achieve considerable performance on both synthetic and real noisy signals. Results show that the articulation-oriented algorithm using DOMP obviously outperforms other algorithms in heavier impulse noise situations.
引用
收藏
页码:69 / 79
页数:11
相关论文
共 53 条
[1]   Audio Inpainting [J].
Adler, Amir ;
Emiya, Valentin ;
Jafari, Maria G. ;
Elad, Michael ;
Gribonval, Remi ;
Plumbley, Mark D. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (03) :922-932
[2]  
Adler A, 2011, INT CONF ACOUST SPEE, P329
[3]   K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J].
Aharon, Michal ;
Elad, Michael ;
Bruckstein, Alfred .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) :4311-4322
[4]  
[Anonymous], 2008, CS TECHNION
[5]  
Arce G.R., 2005, Nonlinear Signal Processing: A Statistical Approach
[6]   A Compressed Sensing Approach to Blind Separation of Speech Mixture Based on a Two-Layer Sparsity Model [J].
Bao, Guangzhao ;
Ye, Zhongfu ;
Xu, Xu ;
Zhou, Yingyue .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (05) :899-906
[7]   A Simple Proof of the Restricted Isometry Property for Random Matrices [J].
Baraniuk, Richard ;
Davenport, Mark ;
DeVore, Ronald ;
Wakin, Michael .
CONSTRUCTIVE APPROXIMATION, 2008, 28 (03) :253-263
[8]  
Barkat M., 2005, SIGNAL DETECTION EST
[9]  
Bishop Christopher, 2006, Pattern Recognition and Machine Learning, DOI 10.1117/1.2819119
[10]   Robust uncertainty principles:: Exact signal reconstruction from highly incomplete frequency information [J].
Candès, EJ ;
Romberg, J ;
Tao, T .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2006, 52 (02) :489-509