Speaker adaptation through speaker specific compensation

被引：0

作者：

Laxman, S ^{[1
]}

Sastry, PS ^{[1
]}

机构：

[1] Indian Inst Sci, Dept Elect Engn, Bangalore 560012, Karnataka, India

来源：

2004 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING & COMMUNICATIONS (SPCOM) | 2004年

关键词：

D O I：

10.1109/SPCOM.2004.1458361

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper describes a new speaker adaptation strategy that we term speaker specific compensation. The basic idea is to transform speech of a speaker in a way that renders it recognizable by a speaker dependent classifier built for another speaker. The compensating filter is learnt as a cepstral vector using labeled speech samples of the speaker. Using some ideas about combining multiple pattern classifiers, we present a new speaker independent speech recognition system that uses a few speaker dependent classifiers along with a bank of cepstral compensating vectors learnt for a large number of other speakers. Each of the speaker dependent classifiers is trained on the given speech samples of only one speaker and is never retrained or adapted thereafter. We present some results to illustrate the effectiveness of this speaker specific compensation idea.

引用

页码：81 / 85

页数：5

共 11 条

[1] The Metamorphic Algorithm: A Speaker Mapping Approach to Data Augmentation
Bellegarda, Jerome R.
de Souza, Peter V.
Nadas, Arthur
Nahamoo, David
Picheny, Michael A.
Bahl, Lalit R.
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (03): : 413 - 420
[2] On combining classifiers
Kittler, J
Hatef, M
Duin, RPW
Matas, J
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (03) : 226 - 239
[3] KUBALA F, 1990, INT CONF ACOUST SPEE, P137, DOI 10.1109/ICASSP.1990.115557
[4] Rapid speaker adaptation in eigenvoice space
Kuhn, R
Junqua, JC
Nguyen, P
Niedzielski, N
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (06): : 695 - 707
[5] LAXMAN S, 2003, TENCON 2003
[6] LAXMAN S, 2002, THESIS INDIAN I SCI
[7] A STUDY ON SPEAKER ADAPTATION OF THE PARAMETERS OF CONTINUOUS DENSITY HIDDEN MARKOV-MODELS
LEE, CH
LIN, CH
JUANG, BH
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (04) : 806 - 814
[8] A frequency warping approach to speaker normalization
Lee, L
Rose, R
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (01): : 49 - 60
[9] Improved speech recognition using a subspace projection approach
Loizou, PC
Spanias, AS
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (03): : 343 - 345
[10] Two timescale analysis of the Alopex algorithm for optimization
Sastry, PS
Magesh, M
Unnikrishnan, KP
[J]. NEURAL COMPUTATION, 2002, 14 (11) : 2729 - 2750

← 1 2 →