Automatic Clustering of Part-of-speech for Vocabulary Divided PLSA Language Model

被引:0
作者
Suzuki, Motoyuki [1 ]
Kuriyama, Naoto [2 ]
Ito, Akinori [2 ]
Makino, Shozo [2 ]
机构
[1] Univ Tokushima, Tokushima 770, Japan
[2] Tohoku Univ, Grad Sch Engn, Sendai, Miyagi, Japan
来源
IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING | 2008年
关键词
Vocabulary divided PLSA; general/style tendency score; part-of-speech; language model; speech recognition;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
PLSA is one of the most powerful language models for adaptation to a target speech. The vocabulary divided PLSA language model (VD-PLSA) shows higher performance than the conventional PLSA model because it can be adapted to the target topic and the target speaking style individually. However, all of the vocabulary must be manually divided into three. categories (topic, speaking style, and general category). In this paper, an automatic method for clustering parts-of-speech (POS) is proposed for VD-PLSA. Several corpora with different styles are prepared, and the distance between corpora in terms of POS is calculated. The "general tendency score" and "style tendency score" for each POS are calculated based on the distance between corpora. All of the POS are divided into three categories using two scores and appropriate thresholds. Experimental results showed the proposed method formed appropriate clusters, and VD-PLSA with acquired categories gave the highest performance of all other models. We applied the VD-PLSA into large vocabulary continuous speech recognition system. VD-PLSA improved the recognition accuracy for documents with lower out-of-vocabulary ratio, while other documents were not improved or slightly descended the accuracy.
引用
收藏
页码:289 / +
页数:2
相关论文
共 10 条
[1]  
[Anonymous], 1999, CLAIMING PLACE P 15
[2]  
Federico M, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P240, DOI 10.1109/ICSLP.1996.607087
[3]  
GILDEA D, 1999, P EUROSPEECH, P2167
[4]  
ITO A, 2006, P WESPAC, V9
[5]   Modeling long distance dependence in language: Topic mixtures versus dynamic cache models [J].
Iyer, RM ;
Ostendorf, M .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (01) :30-39
[6]  
KURIYAMA N, 2006, P 3 WORKSH YEUNGN U, P16
[7]  
Lee A., 2001, Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001)
[8]  
Maekawa K., 2003, PROC ISCA IEEE WORKS
[9]  
Maekawa K., 2000, P LANGUAGE RESOURCES, V6, P1
[10]  
RUDNICKY A, 1995, P ARPA SPOK LANG TEC, P66