A perceptually motivated approach for speech enhancement

被引:60
作者
Hu, Y [1 ]
Loizou, PC [1 ]
机构
[1] Univ Texas, Dept Elect Engn, Richardson, TX 75083 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2003年 / 11卷 / 05期
基金
美国国家卫生研究院;
关键词
multitaper power spectrum estimation; multiwindow covariance matrix estimation; perceptual weighting; spectral subtraction method; speech enhancement; subspace method;
D O I
10.1109/TSA.2003.815936
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A new perceptually motivated approach is proposed in this paper for enhancement of speech corrupted by colored noise. The proposed approach takes into account the frequency masking properties of the human auditory system and reduces the perceptual effect of the residual noise. This new perceptual method is incorporated into a frequency-domain speech enhancement method and a subspace-based speech enhancement method. A better power spectrum/autocorrelation function estimator was also developed to improve the performance of the proposed algorithms. Objective measures and informal listening tests demonstrated significant improvements over other methods when tested with TIMIT sentences corrupted by various types of colored noise.
引用
收藏
页码:457 / 465
页数:9
相关论文
共 32 条
  • [1] [Anonymous], ADV SPEECH SIGNAL PR
  • [2] [Anonymous], P IEEE INT C AC SPEE
  • [3] PREDICTIVE CODING OF SPEECH SIGNALS AND SUBJECTIVE ERROR CRITERIA
    ATAL, BS
    SCHROEDER, MR
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (03): : 247 - 254
  • [4] AZIRANI AA, 1995, INT CONF ACOUST SPEE, P800, DOI 10.1109/ICASSP.1995.479815
  • [5] ALGORITHM - SOLUTION OF MATRIX EQUATION AX+XB = C
    BARTELS, RH
    STEWART, GW
    [J]. COMMUNICATIONS OF THE ACM, 1972, 15 (09) : 820 - &
  • [6] BRANDENBURG K, 1994, J AUDIO ENG SOC, V42, P780
  • [7] Deller J., 2000, Discrete-Time Processing of Speech Signals
  • [8] A SIGNAL SUBSPACE APPROACH FOR SPEECH ENHANCEMENT
    EPHRAIM, Y
    VANTREES, HL
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (04): : 251 - 266
  • [9] GRAY RM, 1972, IEEE T INFORM THEORY, V18, P725, DOI 10.1109/TIT.1972.1054924
  • [10] Gustafsson S, 1998, INT CONF ACOUST SPEE, P397, DOI 10.1109/ICASSP.1998.674451