Association pattern language modelling

被引:29
作者
Chien, Jen-Tzung [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2006年 / 14卷 / 05期
关键词
associaiion pattern; data mining; language model; long distance association; maximum entropy and trigger pair;
D O I
10.1109/TSA.2005.858551
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Statistical n-gram language modeling is popular for speech recognition and many other applications. The conventional n-gram suffers from the insufficiency of modeling long-distance language dependencies,: This paper presents a novel approach focusing on mining long distance word associations and incorporating these features into language models based on linear interpolation and maximum entry (ME) principles. We highlight the discovery of the associations of multiple distant. words from training corpus. A mining algorithm is exploited to, recursively merge the frequent word subsets and efficiently construct the set of association patterns. By, combining the features of association patterns into n-gram models, the association pattern n-grams are estimated with a special realization to trigger pair n-gram where only the associations of two distant Words are considered. In the experiments on Chinese language modeling, we find that the incorporation of association patterns significantly reduces models The incorporation using ME, outperforms that using linear interpolation. Association, pattern n-gram is superior to trigger pair,n-gram. The perplexities are further reduced using more association steps. Further, the proposed association pattern n-grams,are not only able to elevate document classification accuracies but also improve speech recognition. rates.
引用
收藏
页码:1719 / 1728
页数:10
相关论文
共 25 条
  • [1] Agrawal R, 1994, P 20 INT C VER LARG, V1215, P487
  • [2] Exploiting latent semantic information in statistical language modeling
    Bellegarda, JR
    [J]. PROCEEDINGS OF THE IEEE, 2000, 88 (08) : 1279 - 1296
  • [3] An empirical study of smoothing techniques for language modeling
    Chen, SF
    Goodman, J
    [J]. COMPUTER SPEECH AND LANGUAGE, 1999, 13 (04) : 359 - 394
  • [4] Predictive hidden Markov model selection for speech recognition
    Chien, JT
    Furui, S
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (03): : 377 - 387
  • [5] Decision tree state tying using cluster validity criteria
    Chien, JT
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (02): : 182 - 193
  • [6] CHIEN JT, 2004, P INT C LANGUAGE PR, V2, P1373
  • [7] CHIEN JT, 2004, P INT C SPOK LANG PR, V2, P1369
  • [8] Chueh CH, 2004, 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, P309
  • [9] GENERALIZED ITERATIVE SCALING FOR LOG-LINEAR MODELS
    DARROCH, JN
    RATCLIFF, D
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1972, 43 (05): : 1470 - &
  • [10] DELLAPIETRA S, 1992, P IEEE INT C ACOUSTI