Noise-robust speech feature processing with empirical mode decomposition

被引：0

作者：

Kuo-Hau Wu

Chia-Ping Chen

Bing-Feng Yeh

机构：

[1] National Sun Yat-Sen University,Department of Computer Science and Engineering

来源：

EURASIP Journal on Audio, Speech, and Music Processing | / 2011卷

关键词：

Speech Signal; Empirical Mode Decomposition; Automatic Speech Recognition; Intrinsic Mode Function; Lower Envelope;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this article, a novel technique based on the empirical mode decomposition methodology for processing speech features is proposed and investigated. The empirical mode decomposition generalizes the Fourier analysis. It decomposes a signal as the sum of intrinsic mode functions. In this study, we implement an iterative algorithm to find the intrinsic mode functions for any given signal. We design a novel speech feature post-processing method based on the extracted intrinsic mode functions to achieve noise-robustness for automatic speech recognition. Evaluation results on the noisy-digit Aurora 2.0 database show that our method leads to significant performance improvement. The relative improvement over the baseline features increases from 24.0 to 41.1% when the proposed post-processing method is applied on mean-variance normalized speech features. The proposed method also improves over the performance achieved by a very noise-robust frontend when the test speech data are highly mismatched.

引用

共 50 条

[21] Robust Voiced/Unvoiced Speech Classification using Empirical Mode Decomposition and Periodic Correlation Model
Molla, Md. Khademul Islam
Hirose, Keikichi
Minematsu, Nobuaki
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2530 - +
[22] Improved Empirical Mode Decomposition Using Optimal Recursive Averaging Noise Estimation for Speech Enhancement
Asma Bouchair
Sid Ahmed Selouani
Abderrahmane Amrouche
Mohammed Sidi Yakoub
Circuits, Systems, and Signal Processing, 2022, 41 : 196 - 223
[23] Unsupervised learning of time-frequency patches as a noise-robust representation of speech
Van Segbroeck, Maarten
Van Hamme, Hugo
SPEECH COMMUNICATION, 2009, 51 (11) : 1124 - 1138
[24] Improved Empirical Mode Decomposition Using Optimal Recursive Averaging Noise Estimation for Speech Enhancement
Bouchair, Asma
Selouani, Sid Ahmed
Amrouche, Abderrahmane
Sidi Yakoub, Mohammed
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (01) : 196 - 223
[25] Speaker recognition in an emotionalized spontaneous speech using empirical mode decomposition
Chou, Fu-Hua
Liu, Yu-Shuo
Chiou, Che-Wun
IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 387 - +
[26] Investigation of a power distribution of a speech signal under empirical mode decomposition
Priorov, A.
Pavlovichev, P.
2017 SYSTEMS OF SIGNAL SYNCHRONIZATION, GENERATING AND PROCESSING IN TELECOMMUNICATIONS (SINKHROINFO), 2017,
[27] Combining feature space discriminative training with long-term spectro-temporal features for noise-robust speech recognition
Fukuda, Takashi
Ichikawa, Osamu
Nishimura, Masafumi
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 236 - 239
[28] Feature Point Detection Utilizing the Empirical Mode Decomposition
Jesmin Farzana Khan
Kenneth Barner
Reza Adhami
EURASIP Journal on Advances in Signal Processing, 2008
[29] A study of the characteristics of white noise using the empirical mode decomposition method
Wu, ZH
Huang, NE
PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2004, 460 (2046): : 1597 - 1611
[30] Empirical mode decomposition and robust pitch detection based on recurrence analysis
Wang, Jingfang
SENSORS, MEASUREMENT AND INTELLIGENT MATERIALS, PTS 1-4, 2013, 303-306 : 1035 - 1038

← 1 2 3 4 5 →