Extracting Structured Scholarly Information from the Machine Translation Literature

被引:0
作者
Choi, Eunsol [1 ,4 ]
Horvat, Matic [2 ,4 ]
May, Jonathan [3 ]
Knight, Kevin [3 ]
Marcu, Daniel [3 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
[2] Univ Cambridge, Cambridge, England
[3] Informat Sci Inst, Los Angeles, CA USA
[4] ISI, Los Angeles, CA USA
来源
LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2016年
关键词
Information Extraction; Scientific Literature; Structured Prediction;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Understanding the experimental results of a scientific paper is crucial to understanding its contribution and to comparing it with related work. We introduce a structured, queryable representation for experimental results and a baseline system that automatically populates this representation. The representation can answer compositional questions such as: "Which are the best published results reported on the NIST 09 Chinese to English dataset?" and "What are the most important methods for speeding up phrase-based decoding?" Answering such questions usually involves lengthy literature surveys. Current machine reading for academic papers does not usually consider the actual experiments, but mostly focuses on understanding abstracts. We describe annotation work to create an initial hscientific paper; experimental results representationi corpus. The corpus is composed of 67 papers which were manually annotated with a structured representation of experimental results by domain experts. Additionally, we present a baseline algorithm that characterizes the difficulty of the inference task.
引用
收藏
页码:421 / 425
页数:5
相关论文
共 50 条
[31]   Learning from similarity and information extraction from structured documents [J].
Holecek, Martin .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2021,
[32]   Learning from similarity and information extraction from structured documents [J].
Holecek, Martin .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2021, 24 (03) :149-165
[33]   Learning from similarity and information extraction from structured documents [J].
Martin Holeček .
International Journal on Document Analysis and Recognition (IJDAR), 2021, 24 :149-165
[34]   Extracting Events from Web Documents for Social Media Monitoring Using Structured SVM [J].
Choi, Yoonjae ;
Ryu, Pum-Mo ;
Kim, Hyunki ;
Lee, Changki .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (06) :1410-1414
[35]   The parallel corpus for information extraction based on natural language processing and machine translation [J].
He, Honghua .
EXPERT SYSTEMS, 2019, 36 (05)
[36]   Extracting information from free-text aircraft repair notes [J].
Farley, B .
AI EDAM-ARTIFICIAL INTELLIGENCE FOR ENGINEERING DESIGN ANALYSIS AND MANUFACTURING, 2001, 15 (04) :295-305
[37]   Extracting laboratory test information from paper-based reports [J].
Ming-Wei Ma ;
Xian-Shu Gao ;
Ze-Yu Zhang ;
Shi-Yu Shang ;
Ling Jin ;
Pei-Lin Liu ;
Feng Lv ;
Wei Ni ;
Yu-Chen Han ;
Hui Zong .
BMC Medical Informatics and Decision Making, 23
[38]   Extracting laboratory test information from paper-based reports [J].
Ma, Ming-Wei ;
Gao, Xian-Shu ;
Zhang, Ze-Yu ;
Shang, Shi-Yu ;
Jin, Ling ;
Liu, Pei-Lin ;
Lv, Feng ;
Ni, Wei ;
Han, Yu-Chen ;
Zong, Hui .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
[39]   Extracting Information on Rocky Desertification from Satellite Images: A Comparative Study [J].
Pu, Junwei ;
Zhao, Xiaoqing ;
Dong, Pinliang ;
Wang, Qian ;
Yue, Qifa .
REMOTE SENSING, 2021, 13 (13)
[40]   Green Interaction for Extracting Family Information from OCR'd Books [J].
Embley, David W. ;
Nagy, George .
2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, :127-132