Extracting Structured Scholarly Information from the Machine Translation Literature

被引:0
作者
Choi, Eunsol [1 ,4 ]
Horvat, Matic [2 ,4 ]
May, Jonathan [3 ]
Knight, Kevin [3 ]
Marcu, Daniel [3 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
[2] Univ Cambridge, Cambridge, England
[3] Informat Sci Inst, Los Angeles, CA USA
[4] ISI, Los Angeles, CA USA
来源
LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2016年
关键词
Information Extraction; Scientific Literature; Structured Prediction;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Understanding the experimental results of a scientific paper is crucial to understanding its contribution and to comparing it with related work. We introduce a structured, queryable representation for experimental results and a baseline system that automatically populates this representation. The representation can answer compositional questions such as: "Which are the best published results reported on the NIST 09 Chinese to English dataset?" and "What are the most important methods for speeding up phrase-based decoding?" Answering such questions usually involves lengthy literature surveys. Current machine reading for academic papers does not usually consider the actual experiments, but mostly focuses on understanding abstracts. We describe annotation work to create an initial hscientific paper; experimental results representationi corpus. The corpus is composed of 67 papers which were manually annotated with a structured representation of experimental results by domain experts. Additionally, we present a baseline algorithm that characterizes the difficulty of the inference task.
引用
收藏
页码:421 / 425
页数:5
相关论文
共 50 条
[21]   Sequential data search for extracting information from texts [J].
Charnois, Thierry ;
Plantevit, Marc ;
Rigotti, Christophe ;
Cremilleux, Bruno .
TRAITEMENT AUTOMATIQUE DES LANGUES, 2009, 50 (03) :59-87
[22]   Extracting information from free text radiology reports [J].
Johnson D.B. ;
Taira R.K. ;
Cardenas A.F. ;
Aberle D.R. .
International Journal on Digital Libraries, 1997, 1 (3) :297-308
[23]   Extracting trust information from security system of a service [J].
Bahtiyar, Serif ;
Caglayan, Mehmet Ufuk .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2012, 35 (01) :480-490
[24]   Extracting Medication Information from French Clinical Texts [J].
Deleger, Louise ;
Grouin, Cyril ;
Zweigenbaum, Pierre .
MEDINFO 2010, PTS I AND II, 2010, 160 :949-953
[25]   Extracting Proceedings Data from Court Cases with Machine Learning [J].
Mathis, Bruno .
STATS, 2022, 5 (04) :1305-1320
[26]   Extracting Clinical Trial Design Information from MEDLINE Abstracts [J].
Kazuo Hara ;
Yuji Matsumoto .
New Generation Computing, 2007, 25 :263-275
[27]   On extracting link information of relationship instances from a web site [J].
Naing, MM ;
Lim, EP ;
Goh, DHL .
WEB SERVICES -ICWS-EUROPE 2003, PROCEEDINGS, 2003, 2853 :213-226
[28]   Extracting clinical trial design information from MEDLINE abstracts [J].
Hara, Kazuo ;
Matsumoto, Yuji .
NEW GENERATION COMPUTING, 2007, 25 (03) :263-275
[29]   Extracting Information of Future Events from Arabic Newspapers: an Overview [J].
Alruily, Meshrif ;
Alghamdi, Mohammad .
2015 IEEE 9TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2015, :444-447
[30]   Hybrid approach to extracting information from web-tables [J].
Jung, Sung-won ;
Kang, Mi-young ;
Kwon, Hyuk-chul .
COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 :109-+