A Hierarchical Bayesian Language Model based on Pitman-Yor Processes

被引:0
|
作者
Teh, Yee Whye [1 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore 117543, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new hierarchical Bayesian n-gram model of natural languages. Our model makes use of a generalization of the commonly used Dirichlet distributions called Pitman-Yor processes which produce power-law distributions more closely resembling those in natural languages. We show that an approximation to the hierarchical Pitman-Yor language model recovers the exact formulation of interpolated Kneser-Ney, one of the best smoothing methods for n-gram language models. Experiments verify that our model gives cross entropy results superior to interpolated Kneser-Ney and comparable to modified Kneser-Ney.
引用
收藏
页码:985 / 992
页数:8
相关论文
共 50 条
  • [1] Hierarchical Pitman-Yor Language Model for Information Retrieval
    Momtazi, Saeedeh
    Klakow, Dietrich
    SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 793 - 794
  • [2] Nonparametric Bayesian topic modelling with the hierarchical Pitman-Yor processes
    Lim, Kar Wai
    Buntine, Wray
    Chen, Changyou
    Du, Lan
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2016, 78 : 172 - 191
  • [3] Hierarchical Pitman-Yor and Dirichlet Process for Language Model
    Chien, Jen-Tzung
    Chang, Ying-Lan
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2211 - 2215
  • [4] Hierarchical Pitman-Yor language models for ASR in meetings
    Huang, Songfang
    Renals, Steve
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 124 - 129
  • [5] Enriched Pitman-Yor processes
    Rigon, Tommaso
    Petrone, Sonia
    Scarpa, Bruno
    SCANDINAVIAN JOURNAL OF STATISTICS, 2025,
  • [6] A Parallel Training Algorithm for Hierarchical Pitman-Yor Process Language Models
    Huang, Songfang
    Renals, Steve
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2663 - 2666
  • [7] Beta-product dependent Pitman-Yor processes for Bayesian inference
    Bassetti, Federico
    Casarin, Roberto
    Leisen, Fabrizio
    JOURNAL OF ECONOMETRICS, 2014, 180 (01) : 49 - 72
  • [8] On a Pitman-Yor problem
    Iksanov, AM
    Kim, CS
    STATISTICS & PROBABILITY LETTERS, 2004, 68 (01) : 61 - 72
  • [9] Online Learning of Concepts and Words Using Multimodal LDA and Hierarchical Pitman-Yor Language Model
    Araki, Takaya
    Nakamura, Tomoaki
    Nagai, Takayuki
    Nagasaka, Shogo
    Taniguchi, Tadahiro
    Iwahashi, Naoto
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 1623 - 1630
  • [10] Limits of renewal processes and Pitman-Yor distribution
    Basrak, Bojan
    ELECTRONIC COMMUNICATIONS IN PROBABILITY, 2015, 20