A Combination of Semisoft and μ-law Thresholding Functions for Enhancing Noisy Speech in Wavelet Packet Domain

被引:0
作者
Sanam, Tahsina Farah [1 ]
Shahnaz, Celia [2 ]
机构
[1] Bangladesh Univ Engn & Technol, Inst Appropriate Technol, Dhaka, Bangladesh
[2] Bangladesh Univ Engn & Technol, Dept Elect & Elect Engn, Dhaka, Bangladesh
来源
2012 7TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (ICECE) | 2012年
关键词
Wavelet Packet Transform; Speech Enhancement; Statistical Modeling; Teager Energy Operator; Semisoft Threshold; u-law Threshold; ENHANCEMENT;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Estimating an exact threshold value as well as designing an appropriate thresholding function are important criterion for achieving the desired performance of a noisy speech enhancement method in the Wavelet packet (WP) domain. This paper presents a new speech enhancement method, where the threshold in a speech or silent subband is statistically determined using the Teager energy operated WP coefficients of the noisy speech. The threshold thus obtained is applied on the WP coefficients of the noisy speech based on a combination of semisoft and mu-law thresholding functions, which are chosen according to the approximate and detail WP coefficients in a speech or silent subband. Such a thresholding function is designed to minimize signal distortion as well as to prevent time-frequency discontinuity. Standard objective measures as well as subjective observations show that the proposed method outperforms the existing state-of-the-art thresholding based approaches of noisy speech enhancement from high to low levels of SNR thus resulting in an enhanced speech with better quality and intelligibility.
引用
收藏
页数:4
相关论文
共 11 条
  • [1] Visually Derived Wiener Filters for Speech Enhancement
    Almajai, Ibrahim
    Milner, Ben
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1642 - 1651
  • [2] [Anonymous], P ICASSP
  • [3] Wavelet speech enhancement based on the Teager Energy operator
    Bahoura, M
    Rouat, J
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (01) : 10 - 12
  • [4] A Laplacian-based MMSE estimator for speech enhancement
    Chen, Bin
    Loizou, Philipos C.
    [J]. SPEECH COMMUNICATION, 2007, 49 (02) : 134 - 143
  • [5] DE-NOISING BY SOFT-THRESHOLDING
    DONOHO, DL
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1995, 41 (03) : 613 - 627
  • [6] Speech enhancement based on wavelet thresholding the multitaper spectrum
    Hu, Y
    Loizou, PC
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (01): : 59 - 67
  • [7] Subjective comparison and evaluation of speech enhancement algorithms
    Hu, Yi
    Loizou, Philipos C.
    [J]. SPEECH COMMUNICATION, 2007, 49 (7-8) : 588 - 601
  • [8] Kwon Y., 2002, P IEEE INT C AC SPEE, V1
  • [9] OShaughnessy D, 2000, SPEECH ENHANCEMENT T
  • [10] Tabibian S., 2009, COMP C 2009 CSICC 20, P495