Phase AutoCorrelation (PAC) features in entropy based multi-stream for robust speech recognition

被引:0
作者
Ikbal, S [1 ]
Misra, H [1 ]
Bourlard, H [1 ]
Hermansky, H [1 ]
机构
[1] IDIAP, Martigny, Switzerland
来源
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING | 2004年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Methods to improve noise robustness of speech recognition systems often result in degradation of recognition performance for clean speech. Recently proposed Phase AutoCorrelation (PAC) [1, 2] based features, showing noticeable improvement in noise robustness, also suffer from this draw back. In this paper, we try to alleviate this problem by using the PAC based features along with regular speech features in a multi-stream framework. The multi-stream system uses entropy of the posterior probability distribution, computed during recognition, as a confidence measure to adaptively combine evidences from different feature streams [3]. Experimental results obtained on OGI Numbers95 database and Noisex92 noise database show that such a system yields best possible recognition performance in all conditions. Actually, the combination always performs better than the best performing stream for all the conditions.
引用
收藏
页码:205 / 208
页数:4
相关论文
共 11 条
  • [1] BOLL SF, 1979, P IEEE ASSP 27 APR, P113
  • [2] BOURLARD H, 1993, KLUWER INT SERIES EN, P247
  • [3] COLE R, 1995, P EUR C SPEECH COMM, V1, P821
  • [4] PERCEPTUAL LINEAR PREDICTIVE (PLP) ANALYSIS OF SPEECH
    HERMANSKY, H
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (04) : 1738 - 1752
  • [5] RASTA Processing of Speech
    Hermansky, Hynek
    Morgan, Nelson
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04): : 578 - 589
  • [6] IKBAL S, 2003, P ICASSP03 HONG KONG
  • [7] IKBAL S, 2003, P IEEE ASRU 2003 WOR
  • [8] Mansour D., 1988, P ICASSP 88, P36
  • [9] MISRA H, 2003, P ICASSP 03 HONG KON
  • [10] Multi-stream adaptive evidence combination for noise robust ASR
    Morris, A
    Hagen, A
    Glotin, H
    Bourlard, H
    [J]. SPEECH COMMUNICATION, 2001, 34 (1-2) : 25 - 40