Noise Suppression based on nonnegative matrix factorization for robust speech recognition

被引:0
作者
Fan, Hao-teng [1 ]
Lin, Pao-han [1 ]
Hung, Jeih-weih [1 ]
机构
[1] Natl Chi Nan Univ, Dept Elect Engn, Puli, Taiwan
来源
2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3 | 2014年
关键词
nonnegative matrix factorization; noise suppression; speech recognition; noise-robustness;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel noise robustness method, nonnegative matrix factorization-based noise suppression (NNS), to enhance the magnitude spectrum of speech signals for better speech recognition performance in noise-corrupted environments. In the presented approach, the clean data and noise in the training set are firstly converted to the spectrograms via short-time Fourier transform (STFT), and the basis spectral matrices of the speech data and noise are learned from the corresponding spectrograms accordingly. Then, the magnitude spectrogram of the noise-corrupted testing data is factorized via the basis matrices of the clean data, and the resulting noise components are alleviated from the original magnitude spectrogram. Finally, the new noisereduced magnitude spectrogram is integrated with the original noisy phase spectrogram and then converted back to a time-domain signal, which is subsequently converted to a sequence of MFCC speech features. By using the presented NNS as a pre-processing stage of the speech recognition system, the obtained recognition accuracy can outperform the MFCC baseline especially at median and low SNR cases. Furthermore, performing NNS on the different sub-band spectrograms can further improve the recognition results relative to the original NNS performing on the full-band spectrogram, indicating that sub-band NNS can produce more robust speech features suitable for noisy speech recognition.
引用
收藏
页码:1731 / +
页数:2
相关论文
共 50 条
  • [21] Face Recognition Using Region-Based Nonnegative Matrix Factorization
    Byeon, Wonmin
    Jeon, Moongu
    COMMUNICATION AND NETWORKING, 2009, 56 : 621 - 628
  • [22] SPEECH ENHANCEMENT USING SEGMENTAL NONNEGATIVE MATRIX FACTORIZATION
    Fan, Hao-Teng
    Hung, Jeih-weih
    Lu, Xugang
    Wang, Syu-Siang
    Tsao, Yu
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [23] Class-Cone based Nonnegative Matrix Factorization for Face Recognition
    Li, Yang
    Chen, Wen-Sheng
    Pan, Binbin
    Chen, Bo
    2018 14TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2018, : 120 - 124
  • [24] Deep Transductive Nonnegative Matrix Factorization for Speech Separation
    Liu, Yalin
    Guan, Naiyang
    Liu, Jie
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 249 - 254
  • [25] Robust graph regularized nonnegative matrix factorization for clustering
    Huang, Shudong
    Wang, Hongjun
    Li, Tao
    Li, Tianrui
    Xu, Zenglin
    DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (02) : 483 - 503
  • [26] CONSTRAINED NONNEGATIVE MATRIX FACTORIZATION FOR ROBUST HYPERSPECTRAL UNMIXING
    Feng, Fan
    Deng, Chenwei
    Wang, Wenzheng
    Dai, Jiahui
    Li, Zhenzhen
    Zhao, Baojun
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 4221 - 4224
  • [27] Robust graph regularized nonnegative matrix factorization for clustering
    Shudong Huang
    Hongjun Wang
    Tao Li
    Tianrui Li
    Zenglin Xu
    Data Mining and Knowledge Discovery, 2018, 32 : 483 - 503
  • [28] Research on Speech Enhancement Based on Nonnegative Matrix Factorization and Improved Genetic Algorithm
    Wang Wenqi
    Zhang Hongjin
    Fu Shan
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 4950 - 4954
  • [29] Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition
    Gemmeke, Jort F.
    Virtanen, Tuomas
    Hurmalainen, Antti
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 2067 - 2080
  • [30] TRANSIENT NOISE REDUCTION USING NONNEGATIVE MATRIX FACTORIZATION
    Mohammadiha, Nasser
    Doclo, Simon
    2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 27 - 31