ROBUST SPEECH RECOGNITION USING MULTIVARIATE COPULA MODELS

被引:0
|
作者
Bayestehtashk, Alireza [1 ]
Shafran, Izhak [2 ]
Babaeian, Amir [3 ]
机构
[1] Oregon Hlth & Sci Univ, Portland, OR 97201 USA
[2] Google Inc, Mountain View, CA USA
[3] Univ Calif San Diego, La Jolla, CA 92093 USA
来源
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS | 2016年
关键词
Copula model; Robust speech recognition; Deep neural network; Aurora; 4;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we continue our investigation into copula models for real-valued multivariate features with the goal of compensating for the mismatch in the training and the testing conditions. Previously, we reported results on UCI classification tasks where our method consistently outperformed other competing classifiers [1]. Here, we extend this work from classification to recognition and elaborate further on the mathematical properties of our models in the form of lemmas. We report results on the Aurora 4 automatic speech recognition (ASR) task which contains utterances with wide range of background noise that are not well represented in the training data. Our results show that the proposed copula-based models improve the accuracy by about 7% (11.6 vs 12.4) over a comparable baseline.
引用
收藏
页码:5890 / 5894
页数:5
相关论文
共 50 条
  • [41] ROBUST SPEECH RECOGNITION USING DYNAMIC NOISE ADAPTATION
    Rennie, Steven
    Dognin, Pierre
    Fousek, Petr
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4592 - 4595
  • [42] Robust speech recognition using time boundary detection
    Mohajer, K
    Hu, ZM
    MULTISENSOR, MULTISOURCE INFORMATION FUSION: ARCHITECTURES, ALGORITHMS, AND APPLICATIONS 2003, 2003, 5099 : 335 - 343
  • [43] Speech recognition using linear dynamic models
    Frankel, Joe
    King, Simon
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 246 - 256
  • [44] Using SVMs and discriminative models for speech recognition
    Smith, ND
    Gales, MJF
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 77 - 80
  • [45] AUTOMATIC SPEECH RECOGNITION USING PSYCHOACOUSTIC MODELS
    ZWICKER, E
    TERHARDT, E
    PAULUS, E
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (02): : 487 - 498
  • [46] Speech recognition using probabilistic and statistical models
    Singh, Amber
    Anand, R. S.
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 686 - 690
  • [47] Surrogacy assessment using principal stratification with multivariate normal and Gaussian copula models
    Taylor, Jeremy M. G.
    Conlon, Anna S. C.
    Elliott, Michael R.
    CLINICAL TRIALS, 2015, 12 (04) : 317 - 322
  • [48] STRANDED GAUSSIAN MIXTURE HIDDEN MARKOV MODELS FOR ROBUST SPEECH RECOGNITION
    Zhao, Yong
    Juang, Biing-Hwang
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4301 - 4304
  • [49] Combining Multiple Acoustic Models in GMM Spaces for Robust Speech Recognition
    Kang, Byung Ok
    Kwon, Oh-Wook
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (03): : 724 - 730
  • [50] Robust combination of neural networks and hidden Markov models for speech recognition
    Trentin, E
    Gori, M
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2003, 14 (06): : 1519 - 1531