ROBUST SPEECH RECOGNITION USING MULTIVARIATE COPULA MODELS

被引：0

作者：

Bayestehtashk, Alireza ^{[1
]}

Shafran, Izhak ^{[2
]}

Babaeian, Amir ^{[3
]}

机构：

[1] Oregon Hlth & Sci Univ, Portland, OR 97201 USA

[2] Google Inc, Mountain View, CA USA

[3] Univ Calif San Diego, La Jolla, CA 92093 USA

来源：

2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS | 2016年

关键词：

Copula model; Robust speech recognition; Deep neural network; Aurora; 4;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we continue our investigation into copula models for real-valued multivariate features with the goal of compensating for the mismatch in the training and the testing conditions. Previously, we reported results on UCI classification tasks where our method consistently outperformed other competing classifiers [1]. Here, we extend this work from classification to recognition and elaborate further on the mathematical properties of our models in the form of lemmas. We report results on the Aurora 4 automatic speech recognition (ASR) task which contains utterances with wide range of background noise that are not well represented in the training data. Our results show that the proposed copula-based models improve the accuracy by about 7% (11.6 vs 12.4) over a comparable baseline.

引用

页码：5890 / 5894

页数：5

共 50 条

[41] ROBUST SPEECH RECOGNITION USING DYNAMIC NOISE ADAPTATION
Rennie, Steven
Dognin, Pierre
Fousek, Petr
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4592 - 4595
[42] Robust speech recognition using time boundary detection
Mohajer, K
Hu, ZM
MULTISENSOR, MULTISOURCE INFORMATION FUSION: ARCHITECTURES, ALGORITHMS, AND APPLICATIONS 2003, 2003, 5099 : 335 - 343
[43] Speech recognition using linear dynamic models
Frankel, Joe
King, Simon
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 246 - 256
[44] Using SVMs and discriminative models for speech recognition
Smith, ND
Gales, MJF
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 77 - 80
[45] AUTOMATIC SPEECH RECOGNITION USING PSYCHOACOUSTIC MODELS
ZWICKER, E
TERHARDT, E
PAULUS, E
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (02): : 487 - 498
[46] Speech recognition using probabilistic and statistical models
Singh, Amber
Anand, R. S.
2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 686 - 690
[47] Surrogacy assessment using principal stratification with multivariate normal and Gaussian copula models
Taylor, Jeremy M. G.
Conlon, Anna S. C.
Elliott, Michael R.
CLINICAL TRIALS, 2015, 12 (04) : 317 - 322
[48] STRANDED GAUSSIAN MIXTURE HIDDEN MARKOV MODELS FOR ROBUST SPEECH RECOGNITION
Zhao, Yong
Juang, Biing-Hwang
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4301 - 4304
[49] Combining Multiple Acoustic Models in GMM Spaces for Robust Speech Recognition
Kang, Byung Ok
Kwon, Oh-Wook
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (03): : 724 - 730
[50] Robust combination of neural networks and hidden Markov models for speech recognition
Trentin, E
Gori, M
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2003, 14 (06): : 1519 - 1531

← 1 2 3 4 5 →