Musical noise suppression using a low-rank and sparse matrix decomposition approach

被引:3
作者
Sadasivan, Jishnu [1 ]
Dhiman, Jitendra K. [1 ]
Seelamantula, Chandra Sekhar [1 ]
机构
[1] Indian Inst Sci, Dept Elect Engn, Bangalore 560012, Karnataka, India
关键词
Speech enhancement; Musical noise; Low-rank and sparse matrix decomposition; Robust PCA; CHANNEL SPEECH ENHANCEMENT; SPECTRAL SUBTRACTION; MASKING PROPERTIES; SUBSPACE APPROACH; RESIDUAL NOISE; REDUCTION;
D O I
10.1016/j.specom.2020.09.001
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We address the problem of suppressing musical noise from speech enhanced using a short-time processing algorithm. Enhancement algorithms rely on noise statistics and errors in estimating the statistics lead to residual noise in the enhanced signal. A frequently encountered residual noise type is the so-called musical noise, which is a consequence of spurious peaks occurring at random locations in the time-frequency (t-f) plane. Typically, speech enhancement algorithms operate on a short-time basis and perform attenuation of noisy speech spectral coefficients, effectively leading to a spectrotemporal gain function. We show that in case of speech distorted by musical noise, the spectrotemporal gain function has a distinct signature: the musical noise components are sparse in the t-f domain, whereas the spectrotemporal gain corresponding to the speech region exhibits a low-rank structure. Based on this observation, we propose a low-rank and sparse matrix decomposition of the spectrotemporal gain function. We show that musical noise can be effectively suppressed by reconstructing the speech signal using only the low-rank component. Performance comparison in terms of subjective scores and spectrographic analysis shows that the proposed technique is superior compared with two benchmark techniques. The proposed technique could be used in tandem with any speech enhancement algorithm that gives rise to musical noise.
引用
收藏
页码:41 / 52
页数:12
相关论文
共 67 条
  • [1] [Anonymous], 2001, ITU-T Rec. P. 862
  • [2] Araki S, 2005, INT CONF ACOUST SPEE, P81
  • [3] Speech Enhancement Based on Bayesian Low-Rank and Sparse Decomposition of Multichannel Magnitude Spectrograms
    Bando, Yoshiaki
    Itoyama, Katsutoshi
    Konyo, Masashi
    Tadokoro, Satoshi
    Nakadai, Kazuhiro
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    Okuno, Hiroshi G.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (02) : 215 - 230
  • [4] Berouti M., 1979, ICASSP 79. 1979 IEEE International Conference on Acoustics, Speech and Signal Processing, P208
  • [5] Cepstral smoothing of spectral filter gains for speech enhancement without musical noise
    Breithaupt, Colin
    Gerkmann, Timo
    Martin, Rainer
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (12) : 1036 - 1039
  • [6] Robust Principal Component Analysis?
    Candes, Emmanuel J.
    Li, Xiaodong
    Ma, Yi
    Wright, John
    [J]. JOURNAL OF THE ACM, 2011, 58 (03)
  • [7] Elimination of the Musical Noise Phenomenon with the Ephraim and Malah Noise Suppressor
    Cappe, Olivier
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02): : 345 - 349
  • [8] New insights into the noise reduction Wiener filter
    Chen, Jingdong
    Benesty, Jacob
    Huang, Yiteng
    Doclo, Simon
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04): : 1218 - 1234
  • [9] SPEECH ENHANCEMENT FROM NOISE - A REGENERATIVE APPROACH
    DENDRINOS, M
    BAKAMIDIS, S
    CARAYANNIS, G
    [J]. SPEECH COMMUNICATION, 1991, 10 (01) : 45 - 57
  • [10] A spectral filtering method based on hybrid wiener filters for speech enhancement
    Ding, Huijun
    Soon, Ing Yann
    Koh, Soo Nee
    Yeo, Chai Kiat
    [J]. SPEECH COMMUNICATION, 2009, 51 (03) : 259 - 267