Local Sparsity Based Online Dictionary Learning for Environment-Adaptive Speech Enhancement with Nonnegative Matrix Factorization

被引:7
|
作者
Jeon, Kwang Myung [1 ]
Kim, Hong Kook [1 ]
机构
[1] GIST, Sch Elect Engn & Comp Sci, Gwangju 61005, South Korea
来源
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年
基金
新加坡国家研究基金会;
关键词
speech enhancement; diverse noise; environment adaptation; nonnegative matrix factorization; online dictionary learning; local sparsity; NOISE;
D O I
10.21437/Interspeech.2016-586
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a nonnegative matrix factorization (NMF)-based speech enhancement method robust to real and diverse noise is proposed by online NMF dictionary learning without relying on prior knowledge of noise. Conventional NMF-based methods have used a fixed noise dictionary, which often results in performance degradation when the NMF noise dictionary cannot cover noise types that occur in real-life recording. Thus, the noise dictionary needs to be learned from noises according to the variation of recording environments. To this end, the proposed method first estimates noise spectra and then performs online noise dictionary learning by a discriminative NMF learning framework. In particular, the noise spectra are estimated from minimum mean squared error filtering, which is based on the local sparsity defined by a posteriori signal-to-noise ratio (SNR) estimated from the NMF separation of the previous analysis frame. The effectiveness of the proposed speech enhancement method is demonstrated by adding six different realistic noises to clean speech signals with various SNRs Consequently, it is shown that the proposed method outperforms comparative methods in terms of signal-to-distortion ratio (SDR) and perceptual evaluation of speech quality (PESQ) for all kinds of simulated noise and SNR conditions.
引用
收藏
页码:2861 / 2865
页数:5
相关论文
共 50 条
  • [31] SPARSITY LEVEL IN A NON-NEGATIVE MATRIX FACTORIZATION BASED SPEECH STRATEGY IN COCHLEAR IMPLANTS
    Hu, Hongmei
    Mohammadiha, Nasser
    Taghia, Jalil
    Leijon, Arne
    Lutman, Mark E.
    Wang, Shouyan
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2432 - 2436
  • [32] SPECTRAL UNMIXING BASED ON NONNEGATIVE MATRIX FACTORIZATION WITH LOCAL SMOOTHNESS CONSTRAINT
    Yang, Zuyuan
    Yang, Liu
    Cai, Zhaoquan
    Xiang, Yong
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 635 - 638
  • [33] UNSUPERVISED BEAMFORMING BASED ON MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR NOISY SPEECH RECOGNITION
    Shimada, Kazuki
    Bando, Yoshiaki
    Mimura, Masato
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5734 - 5738
  • [34] Non-negative Matrix Factorization Speech Enhancement Method Based on Constraints of Temporal Continuity
    Zou, Qiang
    Sun, Chengli
    Yuan, Conglin
    Sun, Yifan
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 542 - 546
  • [35] INCREMENTAL LEARNING BASED ON BLOCK SPARSE KERNEL NONNEGATIVE MATRIX FACTORIZATION
    Chen, Wen-Sheng
    Li, Yugao
    Pan, Binbin
    Chen, Bo
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION (ICWAPR), 2016, : 219 - 224
  • [36] Transfer Learning via Feature Selection Based Nonnegative Matrix Factorization
    Balasubramaniam, Thirunavukarasu
    Nayak, Richi
    Yuen, Chau
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2019, 2019, 11881 : 82 - 97
  • [37] Adaptive graph-based discriminative nonnegative matrix factorization for image clustering
    Zhang, Ying
    Li, Xiangli
    Jia, Mengxue
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 95
  • [38] Semi-Supervised Speech Enhancement Combining Nonnegative Matrix Factorization and Robust Principal Component Analysis
    Hu, Yonggang
    Zhang, Xiongwei
    Zou, Xia
    Sun, Meng
    Zheng, Yunfei
    Min, Gang
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2017, E100A (08) : 1714 - 1719
  • [39] Global and local similarity learning in multi-kernel space for nonnegative matrix factorization
    Peng, Chong
    Hou, Xingrong
    Chen, Yongyong
    Kang, Zhao
    Chen, Chenglizhao
    Cheng, Qiang
    KNOWLEDGE-BASED SYSTEMS, 2023, 279
  • [40] Alpha-Stable Autoregressive Fast Multichannel Nonnegative Matrix Factorization for Joint Speech Enhancement and Dereverberation
    Fontaine, Mathieu
    Sekiguchi, Kouhei
    Nugraha, Aditya Arie
    Bando, Yoshiaki
    Yoshii, Kazuyoshi
    INTERSPEECH 2021, 2021, : 661 - 665