UNSUPERVISED MONAURAL SPEECH ENHANCEMENT USING ROBUST NMF WITH LOW-RANK AND SPARSE CONSTRAINTS

被引:0
作者
Li, Yinan [1 ]
Zhang, Xiongwei [1 ]
Sun, Meng [1 ]
Min, Gang [1 ]
机构
[1] PLA Univ Sci & Technol, Lab Intelligent Informat Proc, Nanjing, Jiangsu, Peoples R China
来源
2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING | 2015年
关键词
speech enhancement; low-rank and sparse decomposition; non-negative matrix factorization;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Non-negative spectrogram decomposition and its variants have been extensively investigated for speech enhancement due to their efficiency in extracting perceptually meaningful components from mixtures. Usually, these approaches are implemented on the condition that training samples for one or more sources are available beforehand. However, in many real-world scenarios, it is always impossible for conducting any prior training. To solve this problem, we proposed an approach which directly extracts the representations of background noises from the noisy speech via imposing non-negative constraints on the low-rank and sparse decomposition of the noisy spectrogram. The noise representations are subsequently utilized when estimating the clean speech. In this technique, potential spectral structural regularity could be discovered for better reconstruction of clean speech. Evaluations on the Noisex-92 and TIMIT database showed that the proposed method achieves significant improvements over the state-of-the-art methods in unsupervised speech enhancement.
引用
收藏
页码:1 / 4
页数:4
相关论文
共 50 条
[21]   Speech Enhancement Based on Dictionary Learning and Low-Rank Matrix Decomposition [J].
Ji, Yunyun ;
Zhu, Wei-Ping ;
Champagne, Benoit .
IEEE ACCESS, 2019, 7 :4936-4947
[22]   Wavelet-Based Weighted Low-Rank Sparse Decomposition Model for Speech Enhancement Using Gammatone Filter Bank Under Low SNR Conditions [J].
Sridhar, K. Venkata ;
Kumar, T. Kishore .
FLUCTUATION AND NOISE LETTERS, 2023, 22 (02)
[23]   Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition [J].
Shimada, Kazuki ;
Bando, Yoshiaki ;
Mimura, Masato ;
Itoyama, Katsutoshi ;
Yoshii, Kazuyoshi ;
Kawahara, Tatsuya .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (05) :960-971
[24]   MOTION SALIENCY DETECTION USING LOW-RANK AND SPARSE DECOMPOSITION [J].
Xue, Yawen ;
Guo, Xiaojie ;
Cao, Xiaochun .
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, :1485-1488
[25]   Speech Enhancement via Low-rank Matrix Decomposition and Image Based Masking [J].
Liu, Liyang ;
Ding, Zhaogui ;
Li, Weifeng ;
Wang, Longbiao ;
Liao, Qingmin .
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, :389-+
[26]   NONNEGATIVE LOW-RANK SPARSE COMPONENT ANALYSIS [J].
Cohen, Jeremy E. ;
Gillis, Nicolas .
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, :8226-8230
[27]   Robust Localization in Wireless Sensor Networks via Low-rank and'Sparse Matrix Decomposition [J].
Rossi, Beatrice ;
Patane, Marco ;
Fragneto, Pasqualina ;
Fusiello, Andrea .
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON FUTURE NETWORKS AND DISTRIBUTED SYSTEMS (ICFNDS '17), 2017,
[28]   Double Adversarial Network based Monaural Speech Enhancement for Robust Speech Recognition [J].
Du, Zhihao ;
Han, Jiqing ;
Zhang, Xueliang .
INTERSPEECH 2020, 2020, :309-313
[29]   SPEECH ENHANCEMENT USING β- DIVERGENCE BASED NMF WITH UPDATE BASES [J].
Sunnydayal, V. ;
Kumar, T. Kishore .
2016 INTERNATIONAL CONFERENCE ON MICROELECTRONICS, COMPUTING AND COMMUNICATIONS (MICROCOM), 2016,
[30]   Multichannel Linear Prediction-Based Speech Dereverberation Considering Sparse and Low-Rank Priors [J].
Wang, Taihui ;
Yang, Feiran ;
Yang, Jun .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 :1724-1735