NON-LINEAR NOISE COMPENSATION FOR ROBUST SPEECH RECOGNITION USING GAUSS-NEWTON METHOD

被引：0

作者：

Zhao, Yong ^{[1
]}

Juang, Biing-Hwang ^{[1
]}

机构：

[1] Georgia Inst Technol, Ctr Signal & Image Proc, Atlanta, GA 30332 USA

来源：

2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年

关键词：

Gauss-Newton method; non-linear compensation; robust speech recognition; vector Taylor series;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we present the Gauss-Newton method as a unified approach to optimizing non-linear noise compensation models, such as vector Taylor series (VTS), data-driven parallel model combination (DPMC), and unscented transform (UT). We demonstrate that the commonly used approaches that iteratively approximate the noise parameters in an EM framework are variants of the Gauss-Newton method. Through the formulation of the Gauss-Newton method for estimating noise means and variances, the noise estimation problems are reduced to determining the Jacobians of the noisy speech distributions. For the sampling-based compensations, we present two methods, sample Jacobian average (SJA) and cross-covariance (XCOV), to evaluate the Jacobians. Experiments on the Aurora 2 database verify the efficacy of the Gauss-Newton method to these noise compensation models.

引用

页码：4796 / 4799

页数：4

共 38 条

[31] ON USING THE AUDITORY IMAGE MODEL AND INVARIANT-INTEGRATION FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION [J].

Mueller, Florian ;

Mertins, Alfred .

2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, :4905-4908

[32] A feature compensation approach using piecewise linear approximation of an explicit distortion model for noisy speech recognition [J].

Du, Jun ;

Huo, Qiang .

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :4721-+

[33] Real-Time Detection of Moving Magnetic Target Using Distributed Scalar Sensor Based on Hybrid Algorithm of Particle Swarm Optimization and Gauss-Newton Method [J].

Ge, Jian ;

Wang, Shuqiao ;

Dong, Haobin ;

Liu, Huan ;

Zhou, Dan ;

Wu, Shuang ;

Luo, Wang ;

Zhu, Jun ;

Yuan, Zhiwen ;

Zhang, Haiyang .

IEEE SENSORS JOURNAL, 2020, 20 (18) :10717-10723

[34] HMM Adaptation Using Linear Spline Interpolation with Integrated Spline Parameter Training for Robust Speech Recognition [J].

Seltzer, Michael L. ;

Acero, Alex .

11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, :1664-1667

[35] Time-Varying Noise Estimation for Speech Enhancement and Recognition Using Sequential Monte Carlo Method [J].

Kaisheng Yao ;

Te-Won Lee .

EURASIP Journal on Advances in Signal Processing, 2004

[36] Time-varying noise estimation for speech enhancement and recognition using sequential Monte Carlo method [J].

Yao, KS ;

Lee, TW .

EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (15) :2366-2384

[37] SPECTRO-TEMPORAL FEATURES FOR NOISE-ROBUST SPEECH RECOGNITION USING POWER-LAW NONLINEARITY AND POWER-BIAS SUBTRACTION [J].

Chang, Shuo-Yiin ;

Meyer, Bernd T. ;

Morgan, Nelson .

2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, :7063-7067

[38] Experimental Demonstration of 100 Gbps/λ C-Band Direct-Detection Downstream PON Using Non-Linear and CD Compensation with 29 dB+ OPL Over 0 Km-100 Km [J].

Torres-Ferrera, Pablo ;

Rizzelli, Giuseppe ;

Wang, Haoyi ;

Ferrero, Valter ;

Gaudino, Roberto .

JOURNAL OF LIGHTWAVE TECHNOLOGY, 2022, 40 (02) :547-556

← 1 2 3 4 →