Efficient SNR Driven SPLICE Implementation for Robust Speech Recognition

被引:0
|
作者
Squartini, Stefano [1 ]
Principi, Emanuele [1 ]
Cifani, Simone [1 ]
Rotili, Rudi [1 ]
Piazza, Francesco [1 ]
机构
[1] Univ Politecn Marche, DIBET, MediaLabs3, Ancona, Italy
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The SPLICE algorithm has been recently proposed in the literature to address the robustness issue in Automatic Speech Recognition (ASR). Several variants have been also proposed to improve some drawbacks of the original technique. In this presentation an innovative efficient solution is discussed: it is based on SNR estimation in the frequency or mel domain and investigates the possibility of using different noise types for GMM training in order to maximize the generalization capabilities of the tool and therefore the recognition performances in presence of unknown noise sources. Computer simulations, conducted on the AURORA2 database, seem to confirm the effectiveness of the idea: the proposed approach yields similar accuracy performances w.r.t. the reference one, even employing a simpler mismatch compensation paradigm which does not need any a-priori knowledge on the noises used in the training phase.
引用
收藏
页码:70 / 80
页数:11
相关论文
共 50 条
  • [1] SNR-normalization for robust speech recognition
    Claes, T
    VanCompernolle, D
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 331 - 334
  • [2] Uncertainty decoding with splice for noise robust speech recognition
    Droppo, J
    Acero, A
    Deng, L
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 57 - 60
  • [3] Feature enhancement by speaker-normalized splice for robust speech recognition
    Shinohara, Yusuke
    Masuko, Takashi
    Akamine, Masami
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4881 - 4884
  • [4] An efficient algorithm for automatic robust speech recognition
    Kotnik, Bojan
    Kačič, Zdravko
    Horvat, Bogomir
    Elektrotehniski Vestnik/Electrotechnical Review, 2002, 69 (01): : 69 - 74
  • [5] Noise robust speech recognition for voice driven wheelchair
    Sasou, Akira
    Kojima, Hiroaki
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 585 - 588
  • [6] Efficient Speaker and Noise Normalization for Robust Speech Recognition
    Joshi, Vikas
    Bilgi, Raghavendra
    Umesh, S.
    Benitez, C.
    Garcia, L.
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2612 - 2615
  • [7] An efficient framework for robust mobile speech recognition services
    Rose, RC
    Arizmendi, I
    Parthasarathy, S
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 316 - 319
  • [8] SNR Features for Automatic Speech Recognition
    Garner, Philip N.
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 182 - 187
  • [9] Noise suppression based on neurophysiologically-motivated SNR estimation for robust speech recognition
    Tchorz, J
    Kleinschmidt, M
    Kollmeier, B
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 821 - 827
  • [10] Towards efficient and scalable speech compression schemes for robust speech recognition applications
    Srinivasamurthy, N
    Ortega, A
    Zhu, Q
    Alwan, A
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 249 - 252