Extracting Structured Scholarly Information from the Machine Translation Literature

被引:0
作者
Choi, Eunsol [1 ,4 ]
Horvat, Matic [2 ,4 ]
May, Jonathan [3 ]
Knight, Kevin [3 ]
Marcu, Daniel [3 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
[2] Univ Cambridge, Cambridge, England
[3] Informat Sci Inst, Los Angeles, CA USA
[4] ISI, Los Angeles, CA USA
来源
LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2016年
关键词
Information Extraction; Scientific Literature; Structured Prediction;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Understanding the experimental results of a scientific paper is crucial to understanding its contribution and to comparing it with related work. We introduce a structured, queryable representation for experimental results and a baseline system that automatically populates this representation. The representation can answer compositional questions such as: "Which are the best published results reported on the NIST 09 Chinese to English dataset?" and "What are the most important methods for speeding up phrase-based decoding?" Answering such questions usually involves lengthy literature surveys. Current machine reading for academic papers does not usually consider the actual experiments, but mostly focuses on understanding abstracts. We describe annotation work to create an initial hscientific paper; experimental results representationi corpus. The corpus is composed of 67 papers which were manually annotated with a structured representation of experimental results by domain experts. Additionally, we present a baseline algorithm that characterizes the difficulty of the inference task.
引用
收藏
页码:421 / 425
页数:5
相关论文
共 50 条
  • [1] A machine learning framework for extracting information from biological pathway images in the literature
    Kwon, Mun Su
    Lee, Junkyu
    Kim, Hyun Uk
    METABOLIC ENGINEERING, 2024, 86 : 1 - 11
  • [2] Extracting Information about Research Resources from Scholarly Papers
    Saji, Ayahito
    Matsubara, Shigeki
    FROM BORN-PHYSICAL TO BORN-VIRTUAL: AUGMENTING INTELLIGENCE IN DIGITAL LIBRARIES, ICADL 2022, 2022, 13636 : 440 - 448
  • [3] Extracting structured subject information from digital document archives
    Liu, Jyi-Shane
    Lee, Ching-Ying
    Digital Libraries: Achievements, Challenges and Opportunities, Proceedings, 2006, 4312 : 141 - 150
  • [4] MORTY: Structured Summarization for Targeted Information Extraction from Scholarly Articles
    Jaradeh, Mohamad Yaser
    Stocker, Markus
    Auer, Soeren
    FROM BORN-PHYSICAL TO BORN-VIRTUAL: AUGMENTING INTELLIGENCE IN DIGITAL LIBRARIES, ICADL 2022, 2022, 13636 : 290 - 300
  • [5] Extracting contrastive information from negation patterns in biomedical literature
    Korea Advanced Institute of Science and Technology
    不详
    ACM Trans. Asian Lang. Inf. Process., 2006, 1 (44-60): : 44 - 60
  • [6] Recent progress in automatically extracting information from the pharmacogenomic literature
    Garten, Yael
    Coulet, Adrien
    Altman, Russ B.
    PHARMACOGENOMICS, 2010, 11 (10) : 1467 - 1489
  • [7] PROFESS, a system to support extracting protein function information from literature
    Kaneta, Y
    Numa, M
    Munna, MA
    Sakurai, Y
    Ohkawa, T
    Proceedings of the 8th Joint Conference on Information Sciences, Vols 1-3, 2005, : 1265 - 1268
  • [8] Towards building a database of phosphorylate interactions: Extracting information from the literature
    Ravikumar, KE
    Narayanaswamy, M
    Vijay-Shanker, K
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL VII, PROCEEDINGS: APPLICATIONS OF INFORMATICS AND CYBERNETICS IN SCIENCE AND ENGINEERING, 2004, : 57 - 62
  • [9] Extracting phenotypic information from the literature via natural language processing
    Chen, LF
    Friedman, C
    MEDINFO 2004: PROCEEDINGS OF THE 11TH WORLD CONGRESS ON MEDICAL INFORMATICS, PT 1 AND 2, 2004, 107 : 758 - 762
  • [10] Extracting Structured Information from Free-Text Medication Prescriptions Using Dependencies
    MacKinlay, Andrew
    Verspoor, Karin
    PROCEEDINGS OF THE ACM SIXTH INTERNATIONAL WORKSHOP ON DATA AND TEXT MINING IN BIOMEDICAL INFORMATICS, 2012, : 35 - 39