Online Speaker Clustering Using Incremental Learning of an Ergodic Hidden Markov Model

被引:2
|
作者
Koshinaka, Takafumi [1 ,2 ]
Nagatomo, Kentaro [1 ]
Shinoda, Koichi [2 ]
机构
[1] NEC Corp Ltd, Informat & Media Proc Labs, Kawasaki, Kanagawa 2118666, Japan
[2] Tokyo Inst Technol, Dept Comp Sci, Tokyo 1528552, Japan
来源
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2012年 / E95D卷 / 10期
关键词
HMM; model selection; meeting recognition; variational Bayesian learning; ALGORITHM; MIXTURE;
D O I
10.1587/transinf.E95.D.2469
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A novel online speaker clustering method based on a generative model is proposed. It employs an incremental variant of variational Bayesian learning and provides probabilistic (non-deterministic) decisions for each input utterance, on the basis of the history of preceding utterances. It can be expected to be robust against errors in cluster estimation and the classification of utterances, and hence to be applicable to many real-time applications. Experimental results show that it produces 50% fewer classification errors than does a conventional online method. They also show that it is possible to reduce the number of speech recognition errors by combining the method with unsupervised speaker adaptation.
引用
收藏
页码:2469 / 2478
页数:10
相关论文
共 50 条
  • [21] Improved Bayesian learning of hidden Markov models for speaker adaptation
    Chien, JT
    Lee, CH
    Wang, HC
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1027 - 1030
  • [22] Ultra low bit rate speech coding using an ergodic hidden Markov model
    Lee, ME
    Durey, AS
    Moore, E
    Clements, M
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 765 - 768
  • [23] Clustering with Hidden Markov Model on Variable Blocks
    Lin, Lin
    Li, Jia
    JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18
  • [24] Improving clustering with hidden Markov models using Bayesian model selection
    Li, C
    Biswas, G
    SMC 2000 CONFERENCE PROCEEDINGS: 2000 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOL 1-5, 2000, : 194 - 199
  • [25] Subspace distribution clustering hidden Markov model
    Bocchieri, E
    Mak, BKW
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (03): : 264 - 275
  • [26] ONLINE LEARNING FOR BETA-LIOUVILLE HIDDEN MARKOV MODELS: INCREMENTAL VARIATIONAL LEARNING FOR VIDEO SURVEILLANCE AND ACTION RECOGNITION
    Ali, Samr
    Bouguila, Nizar
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3279 - 3283
  • [27] Geological risk prediction under uncertainty in tunnel excavation using online learning and hidden Markov model
    Zhang, Limao
    Wang, Ying
    Fu, Xianlei
    Song, Xieqing
    Lin, Penghui
    FRONTIERS OF ENGINEERING MANAGEMENT, 2024,
  • [28] Pattern classification with incremental class learning and Hidden Markov Models
    Lukaszewski, F
    Nagórko, K
    5th International Conference on Intelligent Systems Design and Applications, Proceedings, 2005, : 216 - 221
  • [29] Online Farsi Handwritten Character Recognition Using Hidden Markov Model
    Ghods, Vahid
    Sohrabi, Mohammad Karim
    JOURNAL OF COMPUTERS, 2016, 11 (02) : 169 - 175
  • [30] Speaker-independent Thai polysyllabic word recognition using hidden Markov model
    Ahkuputra, V
    Jitapunkul, S
    Pornsukchandra, W
    Luksaneeyanawin, S
    1997 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING, VOLS 1 AND 2: PACRIM 10 YEARS - 1987-1997, 1997, : 593 - 599