Manifold HLDA and its application to robust speech recognition

被引:0
|
作者
Kubo, Toshiaki [1 ]
Ogawa, Tetsuji [1 ]
Kobayashi, Tetsunori [1 ]
机构
[1] Waseda Univ, Dept Comp Sci, Shinjuku Ku, Tokyo 1698555, Japan
来源
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年
关键词
HLDA; MHLDA; robust speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A manifold heteroscedastic linear discriminant analysis (MHLDA) which removes environmental information explicitly from the useful information for discrimination is proposed. Usually, a feature parameter used in pattern recognition involves categorical information and also environmental information. A well-known HLDA tries to extract useful information (UT) to represent categorical information from the feature parameter. However, environmental information is still remained in the UI parameters extracted by HLDA, and it causes slight degradation in performance. This is because HLDA does not handle the environmental information explicitly. The proposed MHLDA also tries to extract UI like HLDA, but it handles environmental information explicitly. This handling makes MHLDA-based UI parameter less influenced of environment. However, as compensation, in MHLDA, the categorical information is little bit destroyed. In this paper, we try to combine HLDA-based UI and MHLDA-based UI for pattern recognition, and draw benefit of both parameters. Experimental results show the effectiveness of this combining method.
引用
收藏
页码:1551 / 1554
页数:4
相关论文
共 50 条
  • [1] NOISE AWARE MANIFOLD LEARNING FOR ROBUST SPEECH RECOGNITION
    Tomar, Vikrant Singh
    Rose, Richard C.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7087 - 7091
  • [2] Investigating Manifold Learning Technique for Robust Speech Recognition
    Yan, Bi-Cheng
    Shih, Chin-Hong
    Chen, Berlin
    Liu, Shih-Hung
    2018 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2018, : 68 - 73
  • [3] A Review of Signal Subspace Speech Enhancement and Its Application to Noise Robust Speech Recognition
    Kris Hermus
    Patrick Wambacq
    Hugo Van hamme
    EURASIP Journal on Advances in Signal Processing, 2007
  • [4] A review of signal subspace speech enhancement and its application to noise robust speech recognition
    Hermus, Kris
    Wambacq, Patrick
    Van hamme, Hugo
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)
  • [5] A model of dynamic auditory perception and its application to robust speech recognition
    Strope, B
    Alwan, A
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 37 - 40
  • [6] A Simple and Robust Persian Speech Recognition System and Its Application to Robotics
    Hasanabadi, H.
    Rowhanimanesh, A.
    Yazdi, H. Tabatabaee
    Sharif, N.
    2008 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING, 2008, : 239 - +
  • [7] State-based labelling for a sparse representation of speech and its application to robust speech recognition
    Virtanen, Tuomas
    Gemmeke, Jort F.
    Hurmalainen, Antti
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 893 - +
  • [8] Dynamic Minimum Subband Spectral Subtraction and its application in robust speech recognition
    Ma, Xin
    Peng, Yuhua
    ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 3, PROCEEDINGS, 2006, : 349 - +
  • [9] A Family of Discriminative Manifold Learning Algorithms and Their Application to Speech Recognition
    Tomar, Vikrant Singh
    Rose, Richard C.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (01) : 161 - 171
  • [10] ON THE APPLICATION OF REVERBERATION SUPPRESSION TO ROBUST SPEECH RECOGNITION
    Maas, Roland
    Habets, Emanuel A. P.
    Sehr, Armin
    Kellermann, Walter
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 297 - 300