Wavelet-Based Speech Enhancement Using Time-Frequency Adaptation

被引:2
|
作者
Wang, Kun-Ching [1 ]
机构
[1] Shin Chien Univ, Dept Informat Technol & Commun, Kaohsiung 845, Taiwan
关键词
NOISE; RECOGNITION;
D O I
10.1155/2009/924135
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Wavelet denoising is commonly used for speech enhancement because of the simplicity of its implementation. However, the conventional methods generate the presence of musical residual noise while thresholding the background noise. The unvoiced components of speech are often eliminated from this method. In this paper, a novel algorithm of wavelet coefficient threshold (WCT) based on time-frequency adaptation is proposed. In addition, an unvoiced speech enhancement algorithm is also integrated into the system to improve the intelligibility of speech. The wavelet coefficient threshold (WCT) of each subband is first temporally adjusted according to the value of a posterior signal-to-noise ratio (SNR). To prevent the degradation of unvoiced sounds during noise, the algorithm utilizes a simple speech/noise detector (SND) and further divides speech signal into unvoiced and voiced sounds. Then, we apply appropriate wavelet thresholding according to voiced/unvoiced (V/U) decision. Based on the masking properties of human auditory system, a perceptual gain factor is adopted into wavelet thresholding for suppressing musical residual noise. Simulation results show that the proposed method is capable of reducing noise with little speech degradation and the overall performance is superior to several competitive methods. Copyright (C) 2009 Kun-Ching Wang.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Wavelet-Based Detection of Time-Frequency Changes for Monthly Rainfall and SPI Series in Taiwan
    Jenq-Tzong Shiau
    Yun-Feng Chiu
    Asia-Pacific Journal of Atmospheric Sciences, 2019, 55 : 657 - 667
  • [22] The relationship between emerging and developed market sentiment: A wavelet-based time-frequency analysis
    Dash, Saumya Ranjan
    Maitra, Debasish
    JOURNAL OF BEHAVIORAL AND EXPERIMENTAL FINANCE, 2019, 22 : 135 - 150
  • [23] PHASE TIME-FREQUENCY MASKING BASED SPEECH ENHANCEMENT ALGORITHM USING CIRCULAR MICROPHONE ARRAY
    He, Li
    Zhou, Yi
    Liu, Hongqing
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 808 - 813
  • [24] Time-Frequency Mask-based Speech Enhancement using Convolutional Generative Adversarial Network
    Shah, Neil
    Patil, Hemant A.
    Soni, Meet H.
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1246 - 1251
  • [25] Differential neural responses to acupuncture revealed by MEG using wavelet-based time-frequency analysis: A pilot study
    You, Youbo
    Bai, Lijun
    Dai, Ruwei
    Xue, Ting
    Zhong, Chongguang
    Feng, Yuanyuan
    Wang, Hu
    Liu, Zhenyu
    Tian, Jie
    2011 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2011, : 7099 - 7102
  • [26] Joint Time-Frequency and Time Domain Learning for Speech Enhancement
    Tang, Chuanxin
    Luo, Chong
    Zhao, Zhiyuan
    Xie, Wenxuan
    Zeng, Wenjun
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3816 - 3822
  • [27] Time-frequency localization method for singular signal detection using wavelet-based Holder exponent and Hilbert transform
    Deng, Xiaoyan
    Wang, Qiaohua
    Chen, Xiaokun
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 4, PROCEEDINGS, 2008, : 266 - +
  • [28] Nonstationary Vibration Signal Analysis Using Wavelet-Based Time-Frequency Filter and Wigner-Ville Distribution
    Xu, Chang
    Wang, Cong
    Liu, Wei
    JOURNAL OF VIBRATION AND ACOUSTICS-TRANSACTIONS OF THE ASME, 2016, 138 (05):
  • [29] A Time-Frequency Attention Module for Neural Speech Enhancement
    Zhang, Qiquan
    Qian, Xinyuan
    Ni, Zhaoheng
    Nicolson, Aaron
    Ambikairajah, Eliathamby
    Li, Haizhou
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 462 - 475
  • [30] Integrated speech enhancement and coding in the time-frequency domain
    Drygajlo, A
    Carnero, B
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1183 - 1186