SECOND ORDER VECTOR TAYLOR SERIES BASED ROBUST SPEECH RECOGNITION

被引：0

作者：

Bu, Suliang ^{[1
]}

Qian, Yanmin ^{[1
]}

Sim, Khe Chai ^{[2
]}

You, Yongbin ^{[1
]}

Yu, Kai ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, MOE, Microsoft Key Lab Intelligent Comp & Intelligent, Shanghai 200030, Peoples R China

[2] Natl Univ Singapore, Dept Comp Sci, Singapore 117548, Singapore

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

robust speech recognition; model based compensation; Vector Taylor Series; APPROXIMATION;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Vector Taylor Series (VTS) model based compensation approach has been successfully applied to various robust speech recognition tasks. In this paper, a novel method to derive the formula to calculate the static and dynamic statistics based on second-order VTS (sVTS) is presented, which provides a new insight on the VTS approximation. Lengthy derivation could therefore be avoided when high order VTS is used and the proposed approach is more compact and easier to implement compared to previous high order VTS approaches. Experiments on Aurora 4 showed that the proposed sVTS based model compensation approach obtained 16.7% relative WER reduction over traditional first-order VTS (fVTS) approach.

引用

页数：5

共 50 条

[21] A Feature Compensation Approach Using High-Order Vector Taylor Series Approximation of an Explicit Distortion Model for Noisy Speech Recognition
Du, Jun
Huo, Qiang
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1257 - 1260
[22] Cepstral vector normalization based on stereo data for robust speech recognition
Buera, Luis
Lleida, Eduardo
Miguel, Antonio
Ortega, Alfonso
Saz, Oscar
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03): : 1098 - 1113
[23] Using vector Taylor series with noise clustering for speech recognition in non-stationary noisy environments
Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
High Technol Letters, 2006, 1 (18-23):
[24] Trellis encoded vector quantization for robust speech recognition
Chou, W
Seshadri, N
Rahim, M
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2001 - 2004
[25] Detecting and Labeling Speakers on Overlapping Speech using Vector Taylor Series
Dighe, Pranay
Ferras, Marc
Bourlard, Herve
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 592 - 596
[26] Taylor Series Method for Second-Order Polynomial ODEs
Latypov, Viktor
Sokolov, Sergei
2015 INTERNATIONAL CONFERENCE "STABILITY AND CONTROL PROCESSES" IN MEMORY OF V.I. ZUBOV (SCP), 2015, : 62 - 64
[27] Emotion recognition of speech signal using Taylor series and deep belief network based classification
Valiyavalappil Haridas, Arul
Marimuthu, Ramalatha
Sivakumar, V. G.
Chakraborty, Basabi
EVOLUTIONARY INTELLIGENCE, 2022, 15 (02) : 1145 - 1158
[28] Channel Compensation in the Generalised Vector Taylor Series Approach to Robust ASR
Loweimi, Erfan
Barker, Jon
Hain, Thomas
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2466 - 2470
[29] Robust speech recognition based on independent vector analysis using harmonic frequency dependency
Jun, Soram
Kim, Minook
Oh, Myungwoo
Park, Hyung-Min
NEURAL COMPUTING & APPLICATIONS, 2013, 22 (7-8): : 1321 - 1327
[30] Robust speech recognition based on independent vector analysis using harmonic frequency dependency
Soram Jun
Minook Kim
Myungwoo Oh
Hyung-Min Park
Neural Computing and Applications, 2013, 22 : 1321 - 1327

← 1 2 3 4 5 →