Word-Phrase-Entity Language Models: Getting More Mileage out of N-grams

被引:0
作者
Levit, Michael [1 ]
Parthasarathy, Sarangarajan [1 ]
Chang, Shuangyu [1 ]
Stolcke, Andreas [1 ]
Dumoulin, Benoit [1 ]
机构
[1] Microsoft Corp, Redmond, WA 98052 USA
来源
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4 | 2014年
关键词
class-based LMs; phrase-level LMs;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a modification of the traditional n-gram language modeling approach that departs from the word-level data representation and seeks to re-express the training text in terms of tokens that could be either words, common phrases or instances of one or several classes. Our iterative optimization algorithm considers alternative parses of the corpus in terms of these tokens, re-estimates token n-gram probabilities and also updates within-class distributions. In this paper, we focus on the cold start approach that only assumes availability of the word-level training corpus, as well as a number of generic class definitions. Applied to the calendar scenario in the personal assistant domain, our approach reduces word error rates by more than 13% relative to the word-only n-gram language models. Only a small fraction of these improvements can be ascribed to a larger vocabulary.
引用
收藏
页码:666 / 670
页数:5
相关论文
共 14 条
  • [1] [Anonymous], 2002, P INT
  • [2] Brown P. F., 1992, Computational Linguistics, V18, P467
  • [3] Chen S. F., 2009, Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, P468
  • [4] DELIGNE S, 1995, INT CONF ACOUST SPEE, P169, DOI 10.1109/ICASSP.1995.479391
  • [5] Kuo H. K. J., 1999, P EUR
  • [6] Mikolov T, 2010, 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, P1045
  • [7] The design principles of a weighted finite-state transducer library
    Mohri, M
    Pereira, F
    Riley, M
    [J]. THEORETICAL COMPUTER SCIENCE, 2000, 231 (01) : 17 - 32
  • [8] Olivier D. C., 1968, THESIS
  • [9] Pinto D., 2002, JCDL 2002. Proceedings of the Second ACM/IEEE-CS Joint Conference on Digital Libraries, P46, DOI 10.1145/544220.544228
  • [10] Ries K., 1996, P ICSLP PHIL PA