Wavelet-Based Speech Enhancement Using Time-Frequency Adaptation

被引:2
|
作者
Wang, Kun-Ching [1 ]
机构
[1] Shin Chien Univ, Dept Informat Technol & Commun, Kaohsiung 845, Taiwan
关键词
NOISE; RECOGNITION;
D O I
10.1155/2009/924135
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Wavelet denoising is commonly used for speech enhancement because of the simplicity of its implementation. However, the conventional methods generate the presence of musical residual noise while thresholding the background noise. The unvoiced components of speech are often eliminated from this method. In this paper, a novel algorithm of wavelet coefficient threshold (WCT) based on time-frequency adaptation is proposed. In addition, an unvoiced speech enhancement algorithm is also integrated into the system to improve the intelligibility of speech. The wavelet coefficient threshold (WCT) of each subband is first temporally adjusted according to the value of a posterior signal-to-noise ratio (SNR). To prevent the degradation of unvoiced sounds during noise, the algorithm utilizes a simple speech/noise detector (SND) and further divides speech signal into unvoiced and voiced sounds. Then, we apply appropriate wavelet thresholding according to voiced/unvoiced (V/U) decision. Based on the masking properties of human auditory system, a perceptual gain factor is adopted into wavelet thresholding for suppressing musical residual noise. Simulation results show that the proposed method is capable of reducing noise with little speech degradation and the overall performance is superior to several competitive methods. Copyright (C) 2009 Kun-Ching Wang.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Wavelet-based imagined speech classification using electroencephalography
    Pawar, Dipti
    Dhage, Sudhir
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2022, 38 (03) : 215 - 224
  • [42] Joint Time-Frequency Segmentation Algorithm for Transient Speech Decomposition and Speech Enhancement
    Tantibundhit, Charturong
    Pernkopf, Franz
    Kubin, Gernot
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1417 - 1428
  • [43] Speech enhancement with natural sounding residual noise based on connected time-frequency speech presence regions
    Sorensen, KV
    Andersen, SV
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (18) : 2954 - 2964
  • [44] An Effective Target Speech Enhancement with Single Acoustic Vector Sensor Based on the Speech Time-Frequency Sparsity
    Zou, Y. X.
    Wang, Y. Q.
    Wang, Peng
    Ritz, C. H.
    Xi, Jiangtao
    2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 547 - 551
  • [45] Speech Enhancement with Natural Sounding Residual Noise Based on Connected Time-Frequency Speech Presence Regions
    Karsten Vandborg Sørensen
    Søren Vang Andersen
    EURASIP Journal on Advances in Signal Processing, 2005
  • [46] Wavelet-based document enhancement
    Ahmed, Mohammed N.
    Eid, Ahmed
    NIP 22: 22ND INTERNATIONAL CONFERENCE ON DIGITAL PRINTING TECHNOLOGIES, FINAL PROGRAM AND PROCEEDINGS, 2006, : 256 - +
  • [47] Wavelet-Based Bracketing, Time-Frequency Beta Burst Detection: New Insights in Parkinson's Disease
    Sil, Tanmoy
    Hanafi, Ibrahem
    Eldebakey, Hazem
    Palmisano, Chiara
    Volkmann, Jens
    Muthuraman, Muthuraman
    Reich, Martin M.
    Peach, Robert
    NEUROTHERAPEUTICS, 2023, 20 (06) : 1767 - 1778
  • [48] Improve Speech Enhancement using Perception-High-Related Time-Frequency Loss
    Zhao, Ding
    Zhang, Zhan
    Yu, Bin
    Wang, Yuehai
    INTERSPEECH 2022, 2022, : 5483 - 5487
  • [49] Time-frequency analysis of financial stress and global commodities prices: Insights from wavelet-based approaches
    Armah, Mohammed
    Amewu, Godfred
    Bossman, Ahmed
    COGENT ECONOMICS & FINANCE, 2022, 10 (01):
  • [50] Okun's law revisited in the time-frequency domain: introducing unemployment into a wavelet-based control model
    Crowley, Patrick M.
    Hudgins, David
    EMPIRICAL ECONOMICS, 2021, 61 (05) : 2635 - 2662