Noise-robust speech feature processing with empirical mode decomposition

被引:0
作者
Kuo-Hau Wu
Chia-Ping Chen
Bing-Feng Yeh
机构
[1] National Sun Yat-Sen University,Department of Computer Science and Engineering
来源
EURASIP Journal on Audio, Speech, and Music Processing | / 2011卷
关键词
Speech Signal; Empirical Mode Decomposition; Automatic Speech Recognition; Intrinsic Mode Function; Lower Envelope;
D O I
暂无
中图分类号
学科分类号
摘要
In this article, a novel technique based on the empirical mode decomposition methodology for processing speech features is proposed and investigated. The empirical mode decomposition generalizes the Fourier analysis. It decomposes a signal as the sum of intrinsic mode functions. In this study, we implement an iterative algorithm to find the intrinsic mode functions for any given signal. We design a novel speech feature post-processing method based on the extracted intrinsic mode functions to achieve noise-robustness for automatic speech recognition. Evaluation results on the noisy-digit Aurora 2.0 database show that our method leads to significant performance improvement. The relative improvement over the baseline features increases from 24.0 to 41.1% when the proposed post-processing method is applied on mean-variance normalized speech features. The proposed method also improves over the performance achieved by a very noise-robust frontend when the test speech data are highly mismatched.
引用
收藏
相关论文
共 50 条