Language identification with Dynamic Hidden Markov network

被引:3
作者
Markov, Konstantin [1 ]
Nakamura, Satoshi [1 ]
机构
[1] ATR, Spoken Language Commun Res Labs, Kyoto, Japan
来源
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年
关键词
language identification; never-ending learning; dynamic hidden Markov network; on-line learning; bio-inspired algorithms;
D O I
10.1109/ICASSP.2008.4518589
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we describe new language identification system based on the recently developed Dynamic Hidden Markov network (DHMnet). The DHMnet is a never-ending learning system and provides high resolution model of the speech space. Speech patterns are represented by paths through the network, and these paths when properly labeled with language IDs provide efficient means to discriminate between languages. First experiments indicated that our system can work on-line and is able to deliver relatively high performance with low latency. Evaluated on three language (English, Japanese and Chinese) identification task, the system achieved identification rates of 87.3% and 89.3% for 3 and 5 seconds long speech segments respectively.
引用
收藏
页码:4233 / 4236
页数:4
相关论文
共 9 条
[1]  
[Anonymous], P INT C SPOK LANG PR
[2]  
[Anonymous], P INTERSPEECH
[3]  
MARTINETZ TM, 1994, NEURAL NETWORKS, V3, P507
[4]   The ATR multilingual speech-to-speech translation system [J].
Nakamura, S ;
Markov, K ;
Nakaiwa, H ;
Kikui, G ;
Kawai, H ;
Jitsuhiro, T ;
Zhang, JS ;
Yamamoto, H ;
Sumita, E ;
Yamamoto, S .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (02) :365-376
[5]  
Singer E, 2003, EUR C SPEECH COMM TE, P1345
[6]  
WANG L, 2007, P INT, P174
[7]  
YAN Y, 1995, P IEEE INT C AC SPEE, P3511
[8]  
Zissman M. A., 1997, P EUR, V1, P51
[9]   Comparison of four approaches to automatic language identification of telephone speech [J].
Zissman, MA .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (01) :31-44