Probabilistic top-down parsing and language modeling

被引:129
|
作者
Roark, B [1 ]
机构
[1] Brown Univ, Dept Cognit & Linguist Sci, Providence, RI 02912 USA
关键词
All Open Access; Hybrid Gold; Green;
D O I
10.1162/089120101750300526
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the functioning of a broad-coverage probabilistic top-down parser, and its application to the problem of language modeling for speech recognition. The paper first introduces key notions in language modeling and probabilistic pausing, and briefly reviews some previous approaches to using syntactic structure for language modeling. A lexicalized probabilistic top-down parser is then presented, which performs very well, in terms of both the accuracy of returned parses and the efficiency with which they are found, relative to the best broad-coverage statistical parsers. A new language model that utilizes probabilistic top-down parsing is then outlined, and empirical results show that it improves upon previous work in test corpus perplexity Interpolation with a trigram model yields an exceptional improvement relative to the improvement observed by other models, demonstrating the degree to which the information captured by our parsing model is orthogonal to that captured by a trigram model. A small recognition experiment also demonstrates the utility of the model.
引用
收藏
页码:249 / 276
页数:28
相关论文
共 50 条
  • [31] Bottom-up/top-down image parsing by attribute graph grammar
    Han, F
    Zhu, SC
    TENTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 1778 - 1785
  • [32] Top-down composite modeling of bulk power systems
    Felder, FA
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2005, 20 (03) : 1655 - 1656
  • [33] Top-down modeling of hierarchical biological clock mechanisms
    Nakao, Mitsuyuki
    Okayama, Hiroshi
    Karashima, Akihiro
    Katayama, Norihiro
    SLEEP AND BIOLOGICAL RHYTHMS, 2010, 8 (02) : 106 - 113
  • [34] Top-down modeling of hierarchical biological clock mechanisms
    Mitsuyuki Nakao
    Hiroshi Okayama
    Akihiro Karashima
    Norihiro Katayama
    Sleep and Biological Rhythms, 2010, 8 : 106 - 113
  • [36] Integrated modeling with top-down approach in subsidiary industries
    Aleixos, N
    Company, P
    Contero, M
    COMPUTERS IN INDUSTRY, 2004, 53 (01) : 97 - 116
  • [38] TOP-DOWN GEOMETRIC MODELING OF BUILDINGS ON NETWORK DATABASE
    CHOI, CK
    KIM, ED
    COMPUTER-AIDED DESIGN, 1993, 25 (08) : 468 - 478
  • [39] A Top-Down Modeling Approach for DEMO Magnetic System
    Boso, Daniela P.
    Giannini, Lorenzo
    Corato, Valentina
    IEEE TRANSACTIONS ON APPLIED SUPERCONDUCTIVITY, 2022, 32 (06)
  • [40] Dynamic Oracles for Top-Down and In-Order Shift-Reduce Constituent Parsing
    Fernandez-Gonzalez, Daniel
    Gomez-Rodriguez, Carlos
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1303 - 1313