SPEAKER ADAPTIVE KULLBACK-LEIBLER DIVERGENCE BASED HIDDEN MARKOV MODELS

被引:0
|
作者
Imseng, David [1 ]
Bourlard, Herve [1 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
Kullback-Leibler divergence; speaker adaptation; non-native speech; speech recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Kullback-Leibler divergence based hidden Markov models (KL-HMM) have recently been introduced as an efficient and principled way to directly model sequences of posterior vectors to perform Automatic Speech Recognition (ASR). Through efficient feature level adaptation and parsimonious use of parameters, KL-HMM was successfully applied to accented and under-resourced speech recognition tasks. In this paper, inspired from Maximum A Posteriori (MAP) adaptation, we further boost KL-HMM performance by applying Bayesian speaker adaptation, directly applied to posterior features. This approach performs a simple, adaptive regression between phone posteriors estimated with a Multilayer Perceptron (MLP) on large amounts of speaker-independent training data, and speaker-specific phone posteriors generated by the speaker-independent MLP on very limited amount of speaker-specific adaptation data. Using Swiss French data (MediaParl), we show that such speaker adaptive KL-HMM can significantly outperform conventional adaptation techniques on non-native speech while yielding similar performance on native data.
引用
收藏
页码:7913 / 7917
页数:5
相关论文
共 50 条
  • [1] Variational Kullback-Leibler divergence for hidden Markov models
    Hershey, John R.
    Olsen, Peder A.
    Rennie, Steven J.
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 323 - 328
  • [2] Bounds on the Kullback-Leibler divergence rate between hidden Markov models
    Vidyasagar, M.
    PROCEEDINGS OF THE 46TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2007, : 2196 - 2201
  • [3] Upper bound Kullback-Leibler divergence for transient hidden Markov models
    Silva, Jorge
    Narayanan, Shrikanth
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2008, 56 (09) : 4176 - 4188
  • [4] Kullback-Leibler Divergence and Akaike Information Criterion in General Hidden Markov Models
    Fuh, Cheng-Der
    Kao, Chu-Lan Michael
    Pang, Tianxiao
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2024, 70 (08) : 5888 - 5909
  • [5] UPPER AND LOWER BOUNDS FOR APPROXIMATION OF THE KULLBACK-LEIBLER DIVERGENCE BETWEEN HIDDEN MARKOV MODELS
    Li, Haiyang
    Han, Jiqing
    Zheng, Tieran
    Zheng, Guibin
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7609 - 7613
  • [6] Quickest Change Detection and Kullback-Leibler Divergence for Two-State Hidden Markov Models
    Fuh, Cheng-Der
    Mei, Yajun
    2015 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2015, : 141 - 145
  • [7] Quickest Change Detection and Kullback-Leibler Divergence for Two-State Hidden Markov Models
    Fuh, Cheng-Der
    Mei, Yajun
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2015, 63 (18) : 4866 - 4878
  • [8] Dysarthric Speech Recognition Using Kullback-Leibler Divergence-based Hidden Markov Model
    Kim, Myungjong
    Wang, Jun
    Kim, Hoirin
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2671 - 2675
  • [9] Upper bound Kullback-Leibler divergence for hidden Markov models with application as discrimination measure for speech recognition
    Silva, Jorge
    Narayanan, Shrikanth
    2006 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, VOLS 1-6, PROCEEDINGS, 2006, : 2299 - +
  • [10] The Kullback-Leibler divergence rate between Markov sources
    Rached, Z
    Alajaji, F
    Campbell, LL
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2004, 50 (05) : 917 - 921