Probabilistic top-down parsing and language modeling

被引:129
|
作者
Roark, B [1 ]
机构
[1] Brown Univ, Dept Cognit & Linguist Sci, Providence, RI 02912 USA
关键词
All Open Access; Hybrid Gold; Green;
D O I
10.1162/089120101750300526
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the functioning of a broad-coverage probabilistic top-down parser, and its application to the problem of language modeling for speech recognition. The paper first introduces key notions in language modeling and probabilistic pausing, and briefly reviews some previous approaches to using syntactic structure for language modeling. A lexicalized probabilistic top-down parser is then presented, which performs very well, in terms of both the accuracy of returned parses and the efficiency with which they are found, relative to the best broad-coverage statistical parsers. A new language model that utilizes probabilistic top-down parsing is then outlined, and empirical results show that it improves upon previous work in test corpus perplexity Interpolation with a trigram model yields an exceptional improvement relative to the improvement observed by other models, demonstrating the degree to which the information captured by our parsing model is orthogonal to that captured by a trigram model. A small recognition experiment also demonstrates the utility of the model.
引用
收藏
页码:249 / 276
页数:28
相关论文
共 50 条
  • [41] Applying the Top-down Approach to Beginners in Programming Language Education
    Saito, Daisuke
    Yamaura, Tsuneo
    2014 INTERNATIONAL CONFERENCE ON INTERACTIVE COLLABORATIVE LEARNING (ICL), 2014, : 311 - 318
  • [42] A new top-down parsing algorithm to accommodate ambiguity and left recursion in polynomial time
    Frost, Richard A.
    Hafiz, Rahmatullah
    ACM SIGPLAN NOTICES, 2006, 41 (05) : 46 - 54
  • [43] Top-down Tree Structured Decoding with Syntactic Connections for Neural Machine Translation and Parsing
    Gu, Jetic
    Shavarani, Hassan S.
    Sarkar, Anoop
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 401 - 413
  • [44] Memoizing purely functional top-down backtracking language processors
    Frost, RA
    Szydlowski, B
    SCIENCE OF COMPUTER PROGRAMMING, 1996, 27 (03) : 263 - 288
  • [45] Rule-Based Top-Down Parsing for Acyclic Contextual Hyperedge Replacement Grammars
    Drewes, Frank
    Hoffmann, Berthold
    Minas, Mark
    GRAPH TRANSFORMATION, ICGT 2021, 2021, 12741 : 164 - 184
  • [46] The top-down universe
    Minkel, JR
    NEW SCIENTIST, 2002, 175 (2355) : 28 - 31
  • [47] Top-down proteomics
    Roberts, David S.
    Loo, Joseph A.
    Tsybin, Yury O.
    Liu, Xiaowen
    Wu, Si
    Chamot-Rooke, Julia
    Agar, Jeffrey N.
    Pasa-Tolic, Ljiljana
    Smith, Lloyd M.
    Ge, Ying
    NATURE REVIEWS METHODS PRIMERS, 2024, 4 (01):
  • [48] Top-down savings
    Uzych, L
    HASTINGS CENTER REPORT, 1996, 26 (05) : 3 - 3
  • [49] Top-down research
    Greenberg, Charles B.
    CHEMICAL & ENGINEERING NEWS, 2006, 84 (45) : 3 - 4
  • [50] Top-down safety
    Keeler, A
    CHEMICAL ENGINEER-LONDON, 1999, (692): : 15 - 15