Clean Speech Feature Estimation based on Soft Spectral Masking

被引:0
作者
Kim, Young Joon [1 ]
Lim, Woohyung [2 ,3 ]
Kim, Nam Soo [2 ,3 ]
机构
[1] Elect & Telecommun Res Inst, Deajeon, South Korea
[2] Seoul Natl Univ, Sch Elect Engn, Seoul, South Korea
[3] Seoul Natl Univ, INMC, Seoul, South Korea
来源
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年
关键词
speech recognition; feature compensation; noise masking probability;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we first analyze the problems of speech and noise contamination process in noise-masking point of view, and propose a new approach to estimate degree of noise masking effect on clean speech distribution model based on sequential noise estimation. Sequential noise estimation is performed frame-by-frame using interacting multiple model (IMM) algorithm, so that real-time implementation is possible. After applying IMM algorithm, degree of noise masking effect named as noise masking probability(NMP) is calculated. Estimation of clean speech spectrum in noisy environments is performed by controlling the advantages of log spectrum domain and those of linear spectrum domain algorithm based on NMP. We have performed recognition experiments under noise conditions using the AURORA2 database which is developed for a standard reference of speech recognition performance. Simulation results show that this approach is effective when noise masking effect is dominated at low SNR.
引用
收藏
页码:2550 / +
页数:2
相关论文
共 10 条
  • [1] AGARWAL A, 1999, P IEEE ASRU WORKSH
  • [2] [Anonymous], 2012, ROBUSTNESS AUTOMATIC
  • [3] Droppo J, 2001, INT CONF ACOUST SPEE, P209, DOI 10.1109/ICASSP.2001.940804
  • [4] Robust continuous speech recognition using parallel model combination
    Gales, MJF
    Young, SJ
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (05): : 352 - 359
  • [5] HIRSCH HG, 2000, P ICSLP OCT, P16
  • [6] Feature compensation based on soft decision
    Kim, NS
    Kim, YJ
    Kim, HW
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2004, 11 (03) : 378 - 381
  • [7] Feature domain compensation of nonstationary noise for robust speech recognition
    Kim, NS
    [J]. SPEECH COMMUNICATION, 2002, 37 (3-4) : 231 - 248
  • [8] Kim NS, 1998, IEEE SIGNAL PROC LET, V5, P8, DOI 10.1109/97.654866
  • [9] Segura JC, 2002, INT CONF ACOUST SPEE, P409
  • [10] Young S., 2000, HTK BOOK VERSION 3 0