Speaker adaptation through speaker specific compensation

被引:0
作者
Laxman, S [1 ]
Sastry, PS [1 ]
机构
[1] Indian Inst Sci, Dept Elect Engn, Bangalore 560012, Karnataka, India
来源
2004 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING & COMMUNICATIONS (SPCOM) | 2004年
关键词
D O I
10.1109/SPCOM.2004.1458361
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper describes a new speaker adaptation strategy that we term speaker specific compensation. The basic idea is to transform speech of a speaker in a way that renders it recognizable by a speaker dependent classifier built for another speaker. The compensating filter is learnt as a cepstral vector using labeled speech samples of the speaker. Using some ideas about combining multiple pattern classifiers, we present a new speaker independent speech recognition system that uses a few speaker dependent classifiers along with a bank of cepstral compensating vectors learnt for a large number of other speakers. Each of the speaker dependent classifiers is trained on the given speech samples of only one speaker and is never retrained or adapted thereafter. We present some results to illustrate the effectiveness of this speaker specific compensation idea.
引用
收藏
页码:81 / 85
页数:5
相关论文
共 11 条
  • [1] The Metamorphic Algorithm: A Speaker Mapping Approach to Data Augmentation
    Bellegarda, Jerome R.
    de Souza, Peter V.
    Nadas, Arthur
    Nahamoo, David
    Picheny, Michael A.
    Bahl, Lalit R.
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (03): : 413 - 420
  • [2] On combining classifiers
    Kittler, J
    Hatef, M
    Duin, RPW
    Matas, J
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (03) : 226 - 239
  • [3] KUBALA F, 1990, INT CONF ACOUST SPEE, P137, DOI 10.1109/ICASSP.1990.115557
  • [4] Rapid speaker adaptation in eigenvoice space
    Kuhn, R
    Junqua, JC
    Nguyen, P
    Niedzielski, N
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (06): : 695 - 707
  • [5] LAXMAN S, 2003, TENCON 2003
  • [6] LAXMAN S, 2002, THESIS INDIAN I SCI
  • [7] A STUDY ON SPEAKER ADAPTATION OF THE PARAMETERS OF CONTINUOUS DENSITY HIDDEN MARKOV-MODELS
    LEE, CH
    LIN, CH
    JUANG, BH
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (04) : 806 - 814
  • [8] A frequency warping approach to speaker normalization
    Lee, L
    Rose, R
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (01): : 49 - 60
  • [9] Improved speech recognition using a subspace projection approach
    Loizou, PC
    Spanias, AS
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (03): : 343 - 345
  • [10] Two timescale analysis of the Alopex algorithm for optimization
    Sastry, PS
    Magesh, M
    Unnikrishnan, KP
    [J]. NEURAL COMPUTATION, 2002, 14 (11) : 2729 - 2750