Using a DBN to integrate Sparse Classification and GMM-based ASR

被引:0
|
作者
Sun, Yang [1 ]
Gemmeke, Jort F. [1 ]
Cranen, Bert [1 ]
ten Bosch, Louis [1 ]
Boves, Lou [1 ]
机构
[1] Radboud Univ Nijmegen, Ctr Language & Speech Technol, Nijmegen, Netherlands
来源
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 | 2010年
关键词
noise robustness; speech recognition; dynamic bayesian network; sparse classification; SPEECH RECOGNITION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The performance of an HMM-based speech recognizer using MFCCs as input is known to degrade dramatically in noisy conditions. Recently, an exemplar-based noise robust ASR approach, sparse classification (SC) was introduced. While very successful at lower SNRs, the performance at high SNRs suffered when compared to HMM-based systems. In this work, we propose to use a Dynamic Bayesian Network (DBN) to implement an HMM-model that uses both MFCCs and phone predictions extracted from the SC system as input. By doing experiments on the AURORA-2 connected digit recognition task, we show that our approach successfully combines the strengths of both systems, resulting in competitive recognition accuracies at both high and low SNRs.
引用
收藏
页码:2098 / 2101
页数:4
相关论文
共 50 条
  • [31] A GMM-based User Model for Knowledge Recommendation
    Yang, Nian
    Wang, Guoxin
    Hao, Jia
    Yan, Yan
    Han, Hairong
    2017 3RD IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS (CYBCONF), 2017, : 308 - 312
  • [32] A GMM-Based Speaker Identification System on FPGA
    Kan, Phak Len Eh
    Allen, Tim
    Quigley, Steven F.
    RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2010, 5992 : 358 - 363
  • [33] Quantization for adapted GMM-based speaker verification
    Tseng, Ivy H.
    Verscheure, Olivier
    Turaga, Deepak S.
    Chaudhari, Upendra V.
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 653 - 656
  • [34] Multiple Object Tracking Using Improved GMM-Based Motion Segmentation
    Fazli, Saeid
    Pour, Hamed Moradi
    Bouzari, Hamed
    ECTI-CON: 2009 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2009, : 1096 - 1099
  • [35] Comparing GMM-based speech transformation systems
    Mesbahi, Larbi
    Barreaud, Vincent
    Boeffard, Olivier
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2852 - 2855
  • [36] Improved GMM-based language recognition using constrained MLLR transforms
    Shen, Wade
    Reynolds, Douglas
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4149 - 4152
  • [37] Testing Interval Forecasts: A GMM-Based Approach
    Dumitrescu, Elena-Ivona
    Hurlin, Christophe
    Madkour, Jaouad
    JOURNAL OF FORECASTING, 2013, 32 (02) : 97 - 110
  • [38] GMM-Based Evaluation of Synthetic Speech Quality Using 2D Classification in Pleasure-Arousal Scale
    Pribil, Jiri
    Pribilova, Anna
    Matousek, Jindrich
    APPLIED SCIENCES-BASEL, 2021, 11 (01): : 1 - 18
  • [39] Estimating the position of mistracked coil of EMA Data using GMM-based methods
    Fang, Qiang
    Wei, Jianguo
    Hu, Fang
    Li, Aijun
    Wang, Haibo
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [40] GMM-Based Single-Joint Angle Estimation Using EMG Signals
    Michieletto, Stefano
    Tonin, Luca
    Antonello, Mauro
    Bortoletto, Roberto
    Spolaor, Fabiola
    Pagello, Enrico
    Menegatti, Emanuele
    INTELLIGENT AUTONOMOUS SYSTEMS 13, 2016, 302 : 1173 - 1184