Online Speaker Clustering Using Incremental Learning of an Ergodic Hidden Markov Model

被引:2
作者
Koshinaka, Takafumi [1 ,2 ]
Nagatomo, Kentaro [1 ]
Shinoda, Koichi [2 ]
机构
[1] NEC Corp Ltd, Informat & Media Proc Labs, Kawasaki, Kanagawa 2118666, Japan
[2] Tokyo Inst Technol, Dept Comp Sci, Tokyo 1528552, Japan
关键词
HMM; model selection; meeting recognition; variational Bayesian learning; ALGORITHM; MIXTURE;
D O I
10.1587/transinf.E95.D.2469
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A novel online speaker clustering method based on a generative model is proposed. It employs an incremental variant of variational Bayesian learning and provides probabilistic (non-deterministic) decisions for each input utterance, on the basis of the history of preceding utterances. It can be expected to be robust against errors in cluster estimation and the classification of utterances, and hence to be applicable to many real-time applications. Experimental results show that it produces 50% fewer classification errors than does a conventional online method. They also show that it is possible to reduce the number of speech recognition errors by combining the method with unsupervised speaker adaptation.
引用
收藏
页码:2469 / 2478
页数:10
相关论文
共 50 条
[31]   Model-based margin estimation for hidden Markov model learning and generalisation [J].
Siniscalchi, Sabato Marco ;
Li, Jinyu ;
Lee, Chin-Hui .
IET SIGNAL PROCESSING, 2013, 7 (08) :704-709
[32]   Text-Independent Online Writer Identification Using Hidden Markov Models [J].
Wu, Yabei ;
Lu, Huanzhang ;
Zhang, Zhiyong .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (02) :332-339
[33]   Dictionary learning for sparse representation of signals with hidden Markov model dependency [J].
Akhavan, S. ;
Baghestani, F. ;
Kazemi, P. ;
Karami, A. ;
Soltanian-Zadeh, H. .
DIGITAL SIGNAL PROCESSING, 2022, 123
[34]   Non-intrusive load monitoring using factorial hidden markov model based on adaptive density peak clustering [J].
Wu, Zhao ;
Wang, Chao ;
Peng, Wenxiong ;
Liu, Weihua ;
Zhang, Huaiqing .
ENERGY AND BUILDINGS, 2021, 244
[35]   Sentence lipreading using hidden Markov model with integrated grammar [J].
Yu, K ;
Jiang, XY ;
Bunke, H .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2001, 15 (01) :161-176
[36]   A Novel Voice Authentication Using Hidden Markov Model (HMM) [J].
Davamani, K. Anita ;
Sangeetha, S. .
JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2019, :97-105
[37]   Multistage preictal seizure analysis using Hidden Markov Model [J].
Chiu, Alan W. L. ;
Gadi, Hareesh ;
Moller, Daniel W. ;
Valiante, Taufik A. ;
Andrade, Danielle M. .
INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2012, 10 (02) :160-173
[38]   A multifocus image fusion method by using hidden Markov model [J].
Wu, Wei ;
Yang, Xiaomin ;
Pang, Yu ;
Peng, Jian ;
Jeon, Gwanggil .
OPTICS COMMUNICATIONS, 2013, 287 :63-72
[39]   Iterative receiver for synchronous CDMA using Hidden Markov Model [J].
Khan, E ;
Slock, DTM .
13TH IEEE INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, VOL 1-5, PROCEEDINGS: SAILING THE WAVES OF THE WIRELESS OCEANS, 2002, :1531-1534
[40]   Named Entity Recognition in Hindi Using Hidden Markov Model [J].
Chopra, Deepti ;
Joshi, Nisheeth ;
Mathur, Iti .
2016 SECOND INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2016, :581-586