Upper and lower bounds on the mean of noisy speech: Application to minimax classification

被引:3
作者
Afify, M [1 ]
Siohan, O [1 ]
Lee, CH [1 ]
机构
[1] Bell Labs, Lucent Technol, Multimedia Commun Res Lab, Murray Hill, NJ 07974 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2002年 / 10卷 / 02期
关键词
minimax classification; robust decision rules; speech recognition;
D O I
10.1109/89.985545
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we derive upper and lower bounds on the mean of speech corrupted by additive noise. The bounds are derived in the log spectral domain. Also approximate bounds on the first and second order time derivatives are developed. It is also shown how to transform these bounds to the Mel frequency cepstral coefficient (MFCC) domain. The proposed bounds are used to define the mismatch neighborhood for minimax classification. It is shown that this parametric neighborhood works quite well for artificially added noise and for a real-life mismatch scenario (moving car environment) which does not fully conform with the theoretical conditions used to derive the bounds. In contrast to traditional neighborhood structure for minimax classification, no empirical tuning of the bounds is required. It is believed that the applicability of the derived bounds is not limited to a minimax setting and can be potentially used to develop various compensation scenarios in the log spectral domain.
引用
收藏
页码:79 / 88
页数:10
相关论文
共 14 条
  • [1] AFIFY M, 2001, P IEEE INT C AC SPEE
  • [2] COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES
    DAVIS, SB
    MERMELSTEIN, P
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04): : 357 - 366
  • [3] Durrett R., 1991, PROBABILITY THEORY E
  • [4] CEPSTRAL PARAMETER COMPENSATION FOR HMM RECOGNITION IN NOISE
    GALES, MJF
    YOUNG, SJ
    [J]. SPEECH COMMUNICATION, 1993, 12 (03) : 231 - 239
  • [5] SPEECH RECOGNITION IN NOISY ENVIRONMENTS - A SURVEY
    GONG, YF
    [J]. SPEECH COMMUNICATION, 1995, 16 (03) : 261 - 291
  • [6] Robust speech recognition based on adaptive classification and decision strategies
    Huo, Q
    Lee, CH
    [J]. SPEECH COMMUNICATION, 2001, 34 (1-2) : 175 - 194
  • [7] Robust speech recognition based on a Bayesian prediction approach
    Jiang, H
    Hirose, K
    Huo, Q
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (04): : 426 - 440
  • [8] A minimax search algorithm for robust continuous speech recognition
    Jiang, H
    Hirose, K
    Huo, Q
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (06): : 688 - 694
  • [9] Kim NS, 1998, IEEE SIGNAL PROC LET, V5, P8, DOI 10.1109/97.654866
  • [10] On adaptive decision rules and decision parameter adaptation for automatic speech recognition
    Lee, CH
    Huo, Q
    [J]. PROCEEDINGS OF THE IEEE, 2000, 88 (08) : 1241 - 1269