Automatic Selection of HPSG-Parsed Sentences for Treebank Construction

被引:0
|
作者
Marimon, Montserrat [1 ]
Bel, Nuria [2 ]
Padro, Lluis [3 ]
机构
[1] Univ Barcelona, E-08007 Barcelona, Spain
[2] Univ Pompeu Fabra, Barcelona, Spain
[3] Univ Politecn Cataluna, E-08028 Barcelona, Spain
关键词
D O I
10.1162/COLI_a_00190
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents an ensemble parse approach to detecting and selecting high-quality linguistic analyses output by a hand-crafted HPSG grammar of Spanish implemented in the LKB system. The approach uses full agreement (i.e., exact syntactic match) along with a MaxEnt parse selection model and a statistical dependency parser trained on the same data. The ultimate goal is to develop a hybrid corpus annotation methodology that combines fully automatic annotation and manual parse selection, in order to make the annotation task more efficient while maintaining high accuracy and the high degree of consistency necessary for any foreseen uses of a treebank.
引用
收藏
页码:523 / 531
页数:9
相关论文
共 15 条
  • [1] Semantic annotation of verb arguments in shallow parsed polish sentences by means of the EM selection algorithm
    Hajnicz, Elżbieta
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2009, 5070 LNCS : 211 - 240
  • [2] A Study on the Automatic Selection of Candidate Sentences and Distractors
    Aldabe, Itziar
    Maritxalar, Montse
    Mitkov, Ruslan
    ARTIFICIAL INTELLIGENCE IN EDUCATION: BUILDING LEARNING SYSTEMS THAT CARE: FROM KNOWLEDGE REPRESENTATION TO AFFECTIVE MODELLING, 2009, 200 : 656 - +
  • [3] CF Planter: A Toolset for Semi-automatic Thai Treebank Construction
    Seenual, Pechlada
    Chay-intr, Thodsaporn
    Theeramunkong, Thanaruk
    2018 INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS AND INTELLIGENT TECHNOLOGY & INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY FOR EMBEDDED SYSTEMS (ICESIT-ICICTES), 2018,
  • [4] Automatic selection of informative sentences: The sentences that can generate multiple choice questions
    Majumder, Mukta
    Saha, Sujan Kumar
    KNOWLEDGE MANAGEMENT & E-LEARNING-AN INTERNATIONAL JOURNAL, 2014, 6 (04) : 377 - 391
  • [5] An automatic text summarization based on valuable sentences selection
    Mahalleh E.R.
    Gharehchopogh F.S.
    International Journal of Information Technology, 2022, 14 (6) : 2963 - 2969
  • [6] AUTOMATIC SELECTION OF THE SUBJECT BEARING SENTENCES FROM ABSTRACTS
    HARADA, T
    MARUYAMA, H
    SATOH, M
    HOSONO, K
    MOROHASHI, M
    LIBRARY AND INFORMATION SCIENCE, 1991, (29): : 125 - 137
  • [7] ATOB algorithm: an automatic ontology construction for Thai legal sentences retrieval
    Boonchom, Vi-sit
    Soonthornphisaj, Nuanwan
    JOURNAL OF INFORMATION SCIENCE, 2012, 38 (01) : 37 - 51
  • [8] A Novel Approach for Construction of Sentences for Automatic Story Generation Using Ontology
    Jaya, A.
    Uma, G. V.
    ICCN: 2008 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING, 2008, : 102 - 105
  • [9] RECURSIVE AUTOMATIC BIAS SELECTION FOR CLASSIFIER CONSTRUCTION
    BRODLEY, CE
    MACHINE LEARNING, 1995, 20 (1-2) : 63 - 94
  • [10] AUTOMATIC SELECTION AND ANALYSIS OF VERB AND ADJECTIVE SYNONYMS FROM JAPANESE SENTENCES USING MACHINE LEARNING
    Murata, Masaki
    Orikane, Kazuki
    Akae, Ryota
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2019, 15 (06): : 2135 - 2147