Local Sparsity Based Online Dictionary Learning for Environment-Adaptive Speech Enhancement with Nonnegative Matrix Factorization

被引:7
|
作者
Jeon, Kwang Myung [1 ]
Kim, Hong Kook [1 ]
机构
[1] GIST, Sch Elect Engn & Comp Sci, Gwangju 61005, South Korea
来源
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年
基金
新加坡国家研究基金会;
关键词
speech enhancement; diverse noise; environment adaptation; nonnegative matrix factorization; online dictionary learning; local sparsity; NOISE;
D O I
10.21437/Interspeech.2016-586
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a nonnegative matrix factorization (NMF)-based speech enhancement method robust to real and diverse noise is proposed by online NMF dictionary learning without relying on prior knowledge of noise. Conventional NMF-based methods have used a fixed noise dictionary, which often results in performance degradation when the NMF noise dictionary cannot cover noise types that occur in real-life recording. Thus, the noise dictionary needs to be learned from noises according to the variation of recording environments. To this end, the proposed method first estimates noise spectra and then performs online noise dictionary learning by a discriminative NMF learning framework. In particular, the noise spectra are estimated from minimum mean squared error filtering, which is based on the local sparsity defined by a posteriori signal-to-noise ratio (SNR) estimated from the NMF separation of the previous analysis frame. The effectiveness of the proposed speech enhancement method is demonstrated by adding six different realistic noises to clean speech signals with various SNRs Consequently, it is shown that the proposed method outperforms comparative methods in terms of signal-to-distortion ratio (SDR) and perceptual evaluation of speech quality (PESQ) for all kinds of simulated noise and SNR conditions.
引用
收藏
页码:2861 / 2865
页数:5
相关论文
共 50 条
  • [1] Wavelet Speech Enhancement Based on Nonnegative Matrix Factorization
    Wang, Syu-Siang
    Chern, Alan
    Tsao, Yu
    Hung, Jeih-weih
    Lu, Xugang
    Lai, Ying-Hui
    Su, Borching
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (08) : 1101 - 1105
  • [2] Speech Enhancement Based on Codebook Constrained Nonnegative Matrix Factorization
    Bai, Zhigang
    Bao, Changchun
    Yan, Bofang
    2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 361 - 365
  • [3] Regularized nonnegative matrix factorization with adaptive local structure learning
    Huang, Shudong
    Xu, Zenglin
    Kang, Zhao
    Ren, Yazhou
    NEUROCOMPUTING, 2020, 382 : 196 - 209
  • [4] Constrained nonnegative matrix factorization based on local learning
    Shu, Zhenqiu
    Zhao, Chunxia
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2015, 43 (07): : 82 - 86
  • [5] Adaptive local learning regularized nonnegative matrix factorization for data clustering
    Sheng, Yongpan
    Wang, Meng
    Wu, Tianxing
    Xu, Han
    APPLIED INTELLIGENCE, 2019, 49 (06) : 2151 - 2168
  • [6] Adaptive local learning regularized nonnegative matrix factorization for data clustering
    Yongpan Sheng
    Meng Wang
    Tianxing Wu
    Han Xu
    Applied Intelligence, 2019, 49 : 2151 - 2168
  • [7] SPEECH ENHANCEMENT USING SEGMENTAL NONNEGATIVE MATRIX FACTORIZATION
    Fan, Hao-Teng
    Hung, Jeih-weih
    Lu, Xugang
    Wang, Syu-Siang
    Tsao, Yu
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [8] Nonnegative matrix factorization with region sparsity learning for hyperspectral unmixing
    Qian, Bin
    Tong, Lei
    Tang, Zhenmin
    Shen, Xiaobo
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2017, 15 (06)
  • [9] Dictionary Learning by Nonnegative Matrix Factorization with l1/2-Norm Sparsity Constraint
    Li, Zhenni
    Tang, Zunyi
    Ding, Shuxue
    2013 IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS (CYBCONF), 2013,
  • [10] Local Learning Regularized Nonnegative Matrix Factorization
    Gu, Quanquan
    Zhou, Jie
    21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, 2009, : 1046 - 1051