Distant bigram language modelling using maximum entropy

被引:0
|
作者
Simons, M
Ney, H
Martin, SC
机构
来源
1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS | 1997年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we apply tile maximum entropy approach to so-called distant bigram language modelling. In addition to the usual unigram and bigram dependencies, we use distant bigram dependencies, where tile immediate predecessor word of the word position under consideration is skipped. The contributions of this paper are: (1) We analyze the computational complexity of the resulting training algorithm, i.e. the generalized iterative scaling (GIS) algorithm, and studs the details of its implementation. (2) We describe a method for handling unseen events in the maximum entropy approach; this is achieved by discounting the frequencies of observed events. (3) We study the effect of this discounting operation on the convergence of the GIS algorithm. (4) We give experimental perplexity results for a corpus from the WSJ task. By using the maximum entropy approach and the distant bigram dependencies, we are able to reduce the perplexity from 205.4 for our best conventional bigram model to 169.5.
引用
收藏
页码:787 / 790
页数:4
相关论文
共 50 条
  • [41] An Example on Modelling Conditional Higher Moments using Maximum Entropy Density with High Frequency Data
    Chan, Felix
    MODSIM 2007: INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION: LAND, WATER AND ENVIRONMENTAL MANAGEMENT: INTEGRATED SYSTEMS FOR SUSTAINABILITY, 2007, : 2034 - 2040
  • [42] Inferring climatic controls of rice stem borers' spatial distributions using maximum entropy modelling
    Jalaeian, M.
    Golizadeh, A.
    Sarafrazi, A.
    Naimi, B.
    JOURNAL OF APPLIED ENTOMOLOGY, 2018, 142 (04) : 388 - 396
  • [43] Visibility-informed mapping of potential firefighter lookout locations using maximum entropy modelling
    Mistick, Katherine A.
    Campbell, Michael J.
    Dennison, Philip E.
    INTERNATIONAL JOURNAL OF WILDLAND FIRE, 2024, 33 (09)
  • [44] Modelling the Distribution of Dendrocygna java']javanica in North Sumatera, Indonesia using Maximum Entropy Approach
    Lazuardi
    Prastowo, P.
    Prasetya, E.
    Prakasa, H.
    6TH ANNUAL INTERNATIONAL SEMINAR ON TRENDS IN SCIENCE AND SCIENCE EDUCATION, 2020, 1462
  • [45] Modelling Climate Suitability for Rainfed Maize Cultivation in Kenya Using a Maximum Entropy (MaxENT) Approach
    Kogo, Benjamin Kipkemboi
    Kumar, Lalit
    Koech, Richard
    Kariyawasam, Champika S.
    AGRONOMY-BASEL, 2019, 9 (11):
  • [46] Ecological Niche Modelling Tool for Aquatic Life Population Distribution using Maximum Entropy Model
    King, Riana Joy
    Batista-Navarro, Riza
    Nicolas, Marilou
    Hilomen, Vincent
    Solano, Geoffrey
    2017 8TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS & APPLICATIONS (IISA), 2017, : 225 - 230
  • [47] Using Dependency Grammar Features in Whole Sentence Maximum Entropy Language Model for Speech Recognition
    Ruokolainen, Teemu
    Alumaee, Tanel
    Dobrinkat, Marcus
    HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, 2010, 219 : 73 - 79
  • [48] Solar radiation estimation modelling through the maximum entropy principle
    Razzak, Atteeq
    Khalid, Zoobia
    Rehman, Shafiq Ur
    Zahid, Muhammad Mustaqeem
    Adeel, Muhammad
    INTERNATIONAL JOURNAL OF EXERGY, 2024, 45 (1-2)
  • [49] Maximum entropy based generic filter for language model adaptation
    Yu, D
    Mahajan, M
    Mau, P
    Acero, A
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 597 - 600
  • [50] INVESTIGATION OF SAMPLING TECHNIQUES FOR MAXIMUM ENTROPY LANGUAGE MODELING TRAINING
    Chen, Xie
    Zhang, Jun
    Anastasakos, Tasos
    Alleva, Fil
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7240 - 7244