Probabilistic context-free grammars estimated from infinite distributions

被引:5
|
作者
Corazza, Anna
Satta, Giorgio
机构
[1] Univ Naples Federico II, Dept Phys, I-80126 Naples, Italy
[2] Univ Padua, Dept Informat Engn, I-35131 Padua, Italy
关键词
D O I
10.1109/TPAMI.2007.1065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we consider probabilistic context-free grammars, a class of generative devices that has been successfully exploited in several applications of syntactic pattern matching, especially in statistical natural language parsing. We investigate the problem of training probabilistic context-free grammars on the basis of distributions defined over an infinite set of trees or an infinite set of sentences by minimizing the cross-entropy. This problem has applications in cases of context-free approximation of distributions generated by more expressive statistical models. We show several interesting theoretical properties of probabilistic context-free grammars that are estimated in this way, including the previously unknown equivalence between the grammar cross-entropy with the input distribution and the so-called derivational entropy of the grammar itself. We discuss important consequences of these results involving the standard application of the maximum-likelihood estimator on finite tree and sentence samples, as well as other finite-state models such as Hidden Markov Models and probabilistic finite automata.
引用
收藏
页码:1379 / 1393
页数:15
相关论文
共 50 条
  • [1] Estimation of probabilistic context-free grammars
    Chi, ZY
    Geman, S
    COMPUTATIONAL LINGUISTICS, 1998, 24 (02) : 299 - 305
  • [2] Learning probabilistic context-free grammars from treebanks
    Verdú-Mas, JL
    Calera-Rubio, J
    Carrasco, RC
    PROGRESS IN PATTERN RECOGNITION, SPEECH AND IMAGE ANALYSIS, 2003, 2905 : 537 - 544
  • [3] Statistical properties of probabilistic context-free grammars
    Chi, ZY
    COMPUTATIONAL LINGUISTICS, 1999, 25 (01) : 131 - 160
  • [4] Generalized queries on probabilistic context-free grammars
    Pynadath, DV
    Wellman, MP
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (01) : 65 - 77
  • [5] Generalized queries on probabilistic context-free grammars
    Pynadath, DV
    Wellman, MP
    PROCEEDINGS OF THE THIRTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE, VOLS 1 AND 2, 1996, : 1285 - 1290
  • [6] PROBABILISTIC CONTEXT-FREE GRAMMARS THAT ACHIEVE CAPACITY
    JUSTESEN, J
    LARSEN, KJ
    INFORMATION AND CONTROL, 1975, 29 (03): : 268 - 285
  • [7] Generalized context-free grammars and multiple context-free grammars
    Kasami, Tadao
    Seki, Hiroyuki
    Fujii, Mamoru
    Systems and Computers in Japan, 1989, 20 (07): : 43 - 52
  • [8] Context-Free Tree Grammars are as Powerful as Context-Free Jungle Grammars
    Drewes, Frank
    Engelfriett, Joost
    ACTA CYBERNETICA, 2015, 22 (02): : 373 - 392
  • [9] Compound Probabilistic Context-Free Grammars for Grammar Induction
    Kim, Yoon
    Dyer, Chris
    Rush, Alexander M.
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2369 - 2385
  • [10] Squibs and Discussions: Estimation of Probabilistic Context-Free Grammars
    Division of Applied Mathematics, Brown University, Providence, RI 02912, United States
    Comput. Linguist., 2 (299-305):