Musical noise suppression using a low-rank and sparse matrix decomposition approach

被引：3

作者：

Sadasivan, Jishnu ^{[1
]}

Dhiman, Jitendra K. ^{[1
]}

Seelamantula, Chandra Sekhar ^{[1
]}

机构：

[1] Indian Inst Sci, Dept Elect Engn, Bangalore 560012, Karnataka, India

来源：

SPEECH COMMUNICATION | 2020年 / 125卷

关键词：

Speech enhancement; Musical noise; Low-rank and sparse matrix decomposition; Robust PCA; CHANNEL SPEECH ENHANCEMENT; SPECTRAL SUBTRACTION; MASKING PROPERTIES; SUBSPACE APPROACH; RESIDUAL NOISE; REDUCTION;

D O I：

10.1016/j.specom.2020.09.001

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We address the problem of suppressing musical noise from speech enhanced using a short-time processing algorithm. Enhancement algorithms rely on noise statistics and errors in estimating the statistics lead to residual noise in the enhanced signal. A frequently encountered residual noise type is the so-called musical noise, which is a consequence of spurious peaks occurring at random locations in the time-frequency (t-f) plane. Typically, speech enhancement algorithms operate on a short-time basis and perform attenuation of noisy speech spectral coefficients, effectively leading to a spectrotemporal gain function. We show that in case of speech distorted by musical noise, the spectrotemporal gain function has a distinct signature: the musical noise components are sparse in the t-f domain, whereas the spectrotemporal gain corresponding to the speech region exhibits a low-rank structure. Based on this observation, we propose a low-rank and sparse matrix decomposition of the spectrotemporal gain function. We show that musical noise can be effectively suppressed by reconstructing the speech signal using only the low-rank component. Performance comparison in terms of subjective scores and spectrographic analysis shows that the proposed technique is superior compared with two benchmark techniques. The proposed technique could be used in tandem with any speech enhancement algorithm that gives rise to musical noise.

引用

页码：41 / 52

页数：12

共 67 条

[1]

[Anonymous], 2001, ITU-T Rec. P. 862

[2]

Araki S, 2005, INT CONF ACOUST SPEE, P81

[3] Speech Enhancement Based on Bayesian Low-Rank and Sparse Decomposition of Multichannel Magnitude Spectrograms [J].

Bando, Yoshiaki ;

Itoyama, Katsutoshi ;

Konyo, Masashi ;

Tadokoro, Satoshi ;

Nakadai, Kazuhiro ;

Yoshii, Kazuyoshi ;

Kawahara, Tatsuya ;

Okuno, Hiroshi G. .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (02) :215-230

[4]

Berouti M., 1979, ICASSP 79. 1979 IEEE International Conference on Acoustics, Speech and Signal Processing, P208

[5] Cepstral smoothing of spectral filter gains for speech enhancement without musical noise [J].

Breithaupt, Colin ;

Gerkmann, Timo ;

Martin, Rainer .

IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (12) :1036-1039

[6] Robust Principal Component Analysis? [J].

Candes, Emmanuel J. ;

Li, Xiaodong ;

Ma, Yi ;

Wright, John .

JOURNAL OF THE ACM, 2011, 58 (03)

[7] Elimination of the Musical Noise Phenomenon with the Ephraim and Malah Noise Suppressor [J].

Cappe, Olivier .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02) :345-349

[8] New insights into the noise reduction Wiener filter [J].

Chen, Jingdong ;

Benesty, Jacob ;

Huang, Yiteng ;

Doclo, Simon .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04) :1218-1234

[9] SPEECH ENHANCEMENT FROM NOISE - A REGENERATIVE APPROACH [J].

DENDRINOS, M ;

BAKAMIDIS, S ;

CARAYANNIS, G .

SPEECH COMMUNICATION, 1991, 10 (01) :45-57

[10] A spectral filtering method based on hybrid wiener filters for speech enhancement [J].

Ding, Huijun ;

Soon, Ing Yann ;

Koh, Soo Nee ;

Yeo, Chai Kiat .

SPEECH COMMUNICATION, 2009, 51 (03) :259-267

← 1 2 3 4 5 6 7 →