NON-LINEAR NOISE COMPENSATION FOR ROBUST SPEECH RECOGNITION USING GAUSS-NEWTON METHOD

被引：0

作者：

Zhao, Yong ^{[1
]}

Juang, Biing-Hwang ^{[1
]}

机构：

[1] Georgia Inst Technol, Ctr Signal & Image Proc, Atlanta, GA 30332 USA

来源：

2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年

关键词：

Gauss-Newton method; non-linear compensation; robust speech recognition; vector Taylor series;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we present the Gauss-Newton method as a unified approach to optimizing non-linear noise compensation models, such as vector Taylor series (VTS), data-driven parallel model combination (DPMC), and unscented transform (UT). We demonstrate that the commonly used approaches that iteratively approximate the noise parameters in an EM framework are variants of the Gauss-Newton method. Through the formulation of the Gauss-Newton method for estimating noise means and variances, the noise estimation problems are reduced to determining the Jacobians of the noisy speech distributions. For the sampling-based compensations, we present two methods, sample Jacobian average (SJA) and cross-covariance (XCOV), to evaluate the Jacobians. Experiments on the Aurora 2 database verify the efficacy of the Gauss-Newton method to these noise compensation models.

引用

页码：4796 / 4799

页数：4

共 50 条

[1] Nonlinear Compensation Using the Gauss-Newton Method for Noise-Robust Speech Recognition
Zhao, Yong
Juang, Biing-Hwang
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (08): : 2191 - 2206
[2] A model reduction for highly non-linear problems using wavelets and the Gauss-Newton method
Argaez, Miguel
Florez, Horacio
Mendez, Osvaldo
2016 ANNUAL CONFERENCE OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY (NAFIPS), 2016,
[3] A MODIFIED GAUSS-NEWTON METHOD FOR THE SOLUTION OF NON-LINEAR SQUARES PROBLEMS
HARTLEY, HO
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1960, 55 (290) : 361 - 361
[4] MODIFIED GAUSS-NEWTON METHOD FOR FITTING OF NON-LINEAR REGRESSION FUNCTIONS BY LEAST SQUARES
HARTLEY, HO
TECHNOMETRICS, 1961, 3 (02) : 269 - &
[5] Phaseless Recovery Using the Gauss-Newton Method
Gao, Bing
Xu, Zhiqiang
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2017, 65 (22) : 5885 - 5896
[6] Non-linear techniques for robust speech recognition
Ge, Yubo
Niu, Jing
Ge, Lingnan
Shirai, Katsuhiko
CITSA 2007/CCCT 2007: INTERNATIONAL CONFERENCE ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS : INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL III, POST-CONFERENCE ISSUE, PROCEEDINGS, 2007, : 134 - +
[7] A Noise Robust Speech Recognition Method Using Model Compensation Based on Speech Enhancement
Shen, Guanghu
Jung, Ho-Youl
Chung, Hyun-Yeol
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2008, 27 (04): : 191 - 199
[8] Beyond Linear Transforms: Efficient Non-linear Dynamic Adaptation for Noise Robust Speech Recognition
Rennie, Steven J.
Dognin, Pierre L.
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1305 - 1308
[9] Noise robust speech recognition using Gaussian basis functions for non-linear likelihood function approximation
Pal, C
Frey, B
Kristjansson, T
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 405 - 408
[10] Non-linear feature extraction for robust speech recognition in stationary and non-stationary noise
Zhu, QF
Alwan, A
COMPUTER SPEECH AND LANGUAGE, 2003, 17 (04): : 381 - 402

← 1 2 3 4 5 →