Using a DBN to integrate Sparse Classification and GMM-based ASR

被引:0
|
作者
Sun, Yang [1 ]
Gemmeke, Jort F. [1 ]
Cranen, Bert [1 ]
ten Bosch, Louis [1 ]
Boves, Lou [1 ]
机构
[1] Radboud Univ Nijmegen, Ctr Language & Speech Technol, Nijmegen, Netherlands
来源
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 | 2010年
关键词
noise robustness; speech recognition; dynamic bayesian network; sparse classification; SPEECH RECOGNITION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The performance of an HMM-based speech recognizer using MFCCs as input is known to degrade dramatically in noisy conditions. Recently, an exemplar-based noise robust ASR approach, sparse classification (SC) was introduced. While very successful at lower SNRs, the performance at high SNRs suffered when compared to HMM-based systems. In this work, we propose to use a Dynamic Bayesian Network (DBN) to implement an HMM-model that uses both MFCCs and phone predictions extracted from the SC system as input. By doing experiments on the AURORA-2 connected digit recognition task, we show that our approach successfully combines the strengths of both systems, resulting in competitive recognition accuracies at both high and low SNRs.
引用
收藏
页码:2098 / 2101
页数:4
相关论文
共 50 条
  • [1] Stream Selection and Integration in Multistream ASR Using GMM-Based Performance Monitoring
    Ogawa, Tetsuji
    Li, Feipeng
    Hermansky, Hynek
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3331 - 3335
  • [2] GMM-based classification of genomic sequences
    Akhtar, Mahmood
    Ambikairajah, Eliathamby
    Epps, Julien
    PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 103 - +
  • [3] A GMM-Based Algorithm for Classification of Radar Emitters
    Gong, Xuhua
    Meng, Huadong
    Wang, Xiqin
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 2431 - 2434
  • [4] EARLY FUSION OF SPARSE CLASSIFICATION AND GMM FOR NOISE ROBUST ASR
    Sun, Yang
    Gemmeke, Jort F.
    Cranen, Bert
    ten Bosch, Louis
    Boves, Lou
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1495 - 1499
  • [5] Voice Conversion Using Bilinear Model Integrated with Joint GMM-based Classification
    Sun, Xinjian
    Zhang, Xiongwei
    Yang, Jibin
    Cao, Tieyong
    2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 1225 - 1228
  • [6] Classification of functional brain images using a GMM-based multi-variate approach
    Segovia, F.
    Gorriz, J. M.
    Ramirez, J.
    Salas-Gonzalez, D.
    Alvarez, I.
    Lopez, M.
    Chaves, R.
    Padilla, P.
    NEUROSCIENCE LETTERS, 2010, 474 (01) : 58 - 62
  • [7] GMM-based target classification for ground surveillance Doppler radar
    Bilik, I
    Tabrikian, J
    Cohen, A
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2006, 42 (01) : 267 - 278
  • [8] GMM-based speaker age and gender classification in Czech and Slovak
    Pribil, Jiri
    Pribilova, Anna
    Matousek, Jindrich
    JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS, 2017, 68 (01): : 3 - 12
  • [9] A GMM-based telephone channel classification for Mandarin speech recognition
    Xu, W
    Peng, X
    Wang, BX
    2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 642 - 645
  • [10] GMM-BASED SIGNIFICANCE DECODING
    Abdelaziz, Ahmed Hussen
    Zeiler, Steffen
    Kolossa, Dorothea
    Leutnant, Volker
    Haeb-Umbach, Reinhold
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6827 - 6831