Extracting Structured Scholarly Information from the Machine Translation Literature

被引:0
作者
Choi, Eunsol [1 ,4 ]
Horvat, Matic [2 ,4 ]
May, Jonathan [3 ]
Knight, Kevin [3 ]
Marcu, Daniel [3 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
[2] Univ Cambridge, Cambridge, England
[3] Informat Sci Inst, Los Angeles, CA USA
[4] ISI, Los Angeles, CA USA
来源
LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2016年
关键词
Information Extraction; Scientific Literature; Structured Prediction;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Understanding the experimental results of a scientific paper is crucial to understanding its contribution and to comparing it with related work. We introduce a structured, queryable representation for experimental results and a baseline system that automatically populates this representation. The representation can answer compositional questions such as: "Which are the best published results reported on the NIST 09 Chinese to English dataset?" and "What are the most important methods for speeding up phrase-based decoding?" Answering such questions usually involves lengthy literature surveys. Current machine reading for academic papers does not usually consider the actual experiments, but mostly focuses on understanding abstracts. We describe annotation work to create an initial hscientific paper; experimental results representationi corpus. The corpus is composed of 67 papers which were manually annotated with a structured representation of experimental results by domain experts. Additionally, we present a baseline algorithm that characterizes the difficulty of the inference task.
引用
收藏
页码:421 / 425
页数:5
相关论文
共 50 条
[41]   Extracting information about proper nouns from Arabic newspaper text [J].
Abuleil, S ;
Evens, M .
COMPUTERS AND THEIR APPLICATIONS, 2001, :374-378
[42]   Matching Graph, a Method for Extracting Parallel Information from Comparable Corpora [J].
Bakhshaei, Somayeh ;
Safabakhsh, Reza ;
Khadivi, Shahram .
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (01)
[43]   Semantic Rules for Extracting Proteins Functions Information from Biomedical Abstracts [J].
Taha, Kamal .
PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, :595-598
[44]   From Codebooks to Promptbooks: Extracting Information from Text with Generative Large Language Models [J].
Stuhler, Oscar ;
Ton, Cat Dang ;
Ollion, Etienne .
SOCIOLOGICAL METHODS & RESEARCH, 2025, 54 (03) :794-848
[45]   Pipelines for Procedural Information Extraction from Scientific Literature: Towards Recipes using Machine Learning and Data Science [J].
Yang, Huichen ;
Aguirre, Carlos A. ;
De La Torre, Maria F. ;
Christensen, Derek ;
Bobadilla, Luis ;
Davich, Emily ;
Roth, Jordan ;
Luo, Lei ;
Theis, Yihong ;
Lam, Alice ;
Han, T. Yong-Jin ;
Buttler, David ;
Hsu, William H. .
2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW) AND 2ND INTERNATIONAL WORKSHOP ON OPEN SERVICES AND TOOLS FOR DOCUMENT ANALYSIS (OST), VOL 2, 2019, :41-46
[46]   ON AUTOMATICALLY EXTRACTING THE STATISTICAL INFORMATION OF PORES IN HETEROGENEOUS MATERIALS FROM THE SEM MORPHOLOGY [J].
Li, Haolin ;
Dong, Shuhao ;
Qin, Na ;
Liu, Jiantao ;
Yu, Yaoxiang ;
Zhang, Zhengqing ;
Wu, Muqing ;
Chen, Zhu .
JOURNAL OF POROUS MEDIA, 2021, 24 (08) :83-100
[47]   A method for extracting task-oriented information from biological text sources [J].
Kuttiyapillai, Dhanasekaran ;
Rajeswari, R. .
INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 12 (04) :387-399
[48]   Using contextual information for extracting air target behaviour from sensor tracks [J].
Oxenham, MG .
SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION XII, 2003, 5096 :482-493
[49]   Beyond supervised learning of wrappers for extracting information from unseen Web sites [J].
Wong, TL ;
Lam, W ;
Wang, W .
INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, 2003, 2690 :725-733
[50]   Tree pattern expression for extracting information from syntactically parsed text corpora [J].
Yong Suk Choi .
Data Mining and Knowledge Discovery, 2011, 22 :211-231