Noise Suppression based on nonnegative matrix factorization for robust speech recognition

被引:0
作者
Fan, Hao-teng [1 ]
Lin, Pao-han [1 ]
Hung, Jeih-weih [1 ]
机构
[1] Natl Chi Nan Univ, Dept Elect Engn, Puli, Taiwan
来源
2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3 | 2014年
关键词
nonnegative matrix factorization; noise suppression; speech recognition; noise-robustness;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel noise robustness method, nonnegative matrix factorization-based noise suppression (NNS), to enhance the magnitude spectrum of speech signals for better speech recognition performance in noise-corrupted environments. In the presented approach, the clean data and noise in the training set are firstly converted to the spectrograms via short-time Fourier transform (STFT), and the basis spectral matrices of the speech data and noise are learned from the corresponding spectrograms accordingly. Then, the magnitude spectrogram of the noise-corrupted testing data is factorized via the basis matrices of the clean data, and the resulting noise components are alleviated from the original magnitude spectrogram. Finally, the new noisereduced magnitude spectrogram is integrated with the original noisy phase spectrogram and then converted back to a time-domain signal, which is subsequently converted to a sequence of MFCC speech features. By using the presented NNS as a pre-processing stage of the speech recognition system, the obtained recognition accuracy can outperform the MFCC baseline especially at median and low SNR cases. Furthermore, performing NNS on the different sub-band spectrograms can further improve the recognition results relative to the original NNS performing on the full-band spectrogram, indicating that sub-band NNS can produce more robust speech features suitable for noisy speech recognition.
引用
收藏
页码:1731 / +
页数:2
相关论文
共 50 条
[41]   Semi-Supervised Speech Enhancement Combining Nonnegative Matrix Factorization and Robust Principal Component Analysis [J].
Hu, Yonggang ;
Zhang, Xiongwei ;
Zou, Xia ;
Sun, Meng ;
Zheng, Yunfei ;
Min, Gang .
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2017, E100A (08) :1714-1719
[42]   AMPLITUDE-BASED SPEECH ENHANCEMENT WITH NONNEGATIVE MATRIX FACTORIZATION FOR ASYNCHRONOUS DISTRIBUTED RECORDING [J].
Chiba, Hironobu ;
Ono, Nobutaka ;
Miyabe, Shigeki ;
Takahashi, Yu ;
Yamada, Takeshi ;
Makino, Shoji .
2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, :203-207
[43]   Distributionally Robust and Multi-Objective Nonnegative Matrix Factorization [J].
Gillis, Nicolas ;
Le Thi Khanh Hien ;
Leplat, Valentin ;
Tan, Vincent Y. F. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) :4052-4064
[44]   Fast and Robust Recursive Algorithms for Separable Nonnegative Matrix Factorization [J].
Gillis, Nicolas ;
Vavasis, Stephen A. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (04) :698-714
[45]   Reconstruction of reflectance spectra using robust nonnegative matrix factorization [J].
Ben Hamza, A. ;
Brady, David J. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (09) :3637-3642
[46]   LUNG SEGMENTATION BASED ON NONNEGATIVE MATRIX FACTORIZATION [J].
Hosseini-Asl, Ehsan ;
Zurada, Jacek M. ;
El-Baz, Ayman .
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, :877-881
[47]   LOCALITY PRESERVING NONNEGATIVE MATRIX FACTORIZATION WITH APPLICATION TO FACE RECOGNITION [J].
Zhang, Taiping ;
Fang, Bin ;
Tang, Yuan Y. ;
Shang, Zhaowei .
INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2010, 8 (05) :835-846
[48]   NON-NEGATIVE MATRIX DECONVOLUTION IN NOISE ROBUST SPEECH RECOGNITION [J].
Hurmalainen, Antti ;
Gemmeke, Jort ;
Virtanen, Tuomas .
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, :4588-4591
[49]   An Application Specific Matrix Processor for Signal subspace based speech enhancement in noise robust speech recognition applications [J].
Natarajan, Karthikeyan ;
Arun, S. ;
Murugaraj, K. ;
John, Mala .
ASICON 2007: 2007 7TH INTERNATIONAL CONFERENCE ON ASIC, VOLS 1 AND 2, PROCEEDINGS, 2007, :766-769
[50]   Manifold Adaptive Kernel Nonnegative Matrix Factorization for Face Recognition [J].
Sun, Xia ;
Wang, Ziqiang ;
Sun, Lijun .
JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2015, 12 (09) :2710-2719