Noise-robust speech feature processing with empirical mode decomposition

被引:0
|
作者
Kuo-Hau Wu
Chia-Ping Chen
Bing-Feng Yeh
机构
[1] National Sun Yat-Sen University,Department of Computer Science and Engineering
来源
EURASIP Journal on Audio, Speech, and Music Processing | / 2011卷
关键词
Speech Signal; Empirical Mode Decomposition; Automatic Speech Recognition; Intrinsic Mode Function; Lower Envelope;
D O I
暂无
中图分类号
学科分类号
摘要
In this article, a novel technique based on the empirical mode decomposition methodology for processing speech features is proposed and investigated. The empirical mode decomposition generalizes the Fourier analysis. It decomposes a signal as the sum of intrinsic mode functions. In this study, we implement an iterative algorithm to find the intrinsic mode functions for any given signal. We design a novel speech feature post-processing method based on the extracted intrinsic mode functions to achieve noise-robustness for automatic speech recognition. Evaluation results on the noisy-digit Aurora 2.0 database show that our method leads to significant performance improvement. The relative improvement over the baseline features increases from 24.0 to 41.1% when the proposed post-processing method is applied on mean-variance normalized speech features. The proposed method also improves over the performance achieved by a very noise-robust frontend when the test speech data are highly mismatched.
引用
收藏
相关论文
共 50 条
  • [21] Robust Voiced/Unvoiced Speech Classification using Empirical Mode Decomposition and Periodic Correlation Model
    Molla, Md. Khademul Islam
    Hirose, Keikichi
    Minematsu, Nobuaki
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2530 - +
  • [22] Improved Empirical Mode Decomposition Using Optimal Recursive Averaging Noise Estimation for Speech Enhancement
    Asma Bouchair
    Sid Ahmed Selouani
    Abderrahmane Amrouche
    Mohammed Sidi Yakoub
    Circuits, Systems, and Signal Processing, 2022, 41 : 196 - 223
  • [23] Unsupervised learning of time-frequency patches as a noise-robust representation of speech
    Van Segbroeck, Maarten
    Van Hamme, Hugo
    SPEECH COMMUNICATION, 2009, 51 (11) : 1124 - 1138
  • [24] Improved Empirical Mode Decomposition Using Optimal Recursive Averaging Noise Estimation for Speech Enhancement
    Bouchair, Asma
    Selouani, Sid Ahmed
    Amrouche, Abderrahmane
    Sidi Yakoub, Mohammed
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (01) : 196 - 223
  • [25] Speaker recognition in an emotionalized spontaneous speech using empirical mode decomposition
    Chou, Fu-Hua
    Liu, Yu-Shuo
    Chiou, Che-Wun
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 387 - +
  • [26] Investigation of a power distribution of a speech signal under empirical mode decomposition
    Priorov, A.
    Pavlovichev, P.
    2017 SYSTEMS OF SIGNAL SYNCHRONIZATION, GENERATING AND PROCESSING IN TELECOMMUNICATIONS (SINKHROINFO), 2017,
  • [27] Combining feature space discriminative training with long-term spectro-temporal features for noise-robust speech recognition
    Fukuda, Takashi
    Ichikawa, Osamu
    Nishimura, Masafumi
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 236 - 239
  • [28] Feature Point Detection Utilizing the Empirical Mode Decomposition
    Jesmin Farzana Khan
    Kenneth Barner
    Reza Adhami
    EURASIP Journal on Advances in Signal Processing, 2008
  • [29] A study of the characteristics of white noise using the empirical mode decomposition method
    Wu, ZH
    Huang, NE
    PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2004, 460 (2046): : 1597 - 1611
  • [30] Empirical mode decomposition and robust pitch detection based on recurrence analysis
    Wang, Jingfang
    SENSORS, MEASUREMENT AND INTELLIGENT MATERIALS, PTS 1-4, 2013, 303-306 : 1035 - 1038