SYNTACTIC SIMPLIFICATION AND SEMANTIC ENRICHMENT-TRIMMING DEPENDENCY GRAPHS FOR EVENT EXTRACTION

被引:11
作者
Buyko, Ekaterina [1 ]
Faessler, Erik [1 ]
Wermter, Joachim [1 ]
Hahn, Udo [1 ]
机构
[1] Univ Jena, Jena Univ Language & Informat Engn JULIE Lab, D-07743 Jena, Germany
关键词
biomedical natural language processing; dependency parsing; event extraction; relation extraction; SENTENCE COMPRESSION; INFORMATION; CORPUS; TEXT;
D O I
10.1111/j.1467-8640.2011.00402.x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In our approach to event extraction, dependency graphs constitute the fundamental data structure for knowledge capture. Two types of trimming operations pave the way to more effective relation extraction. First, we simplify the syntactic representation structures resulting from parsing by pruning informationally irrelevant lexical material from dependency graphs. Second, we enrich informationally relevant lexical material in the simplified dependency graphs with additional semantic meta data at several layers of conceptual granularity. These two aggregation operations on linguistic representation structures are intended to avoid overfitting of machine learning-based classifiers which we use for event extraction (besides manually curated dictionaries). Given this methodological framework, the corresponding JReX system developed by the JulieLab Team from Friedrich-Schiller-Universitat Jena (Germany) scored on 2nd rank among 24 competing teams for Task 1 in the BioNLP09 Shared Task on Event Extraction, with 45.8% recall, 47.5% precision and 46.7% F1-score on all 3,182 events. In more recent experiments, based on slight modifications of JReX and using the same data sets, we were able to achieve 45.9% recall, 57.7% precision, and 51.1% F1-score.
引用
收藏
页码:610 / 644
页数:35
相关论文
共 64 条
[1]  
Ahn D., 2006, P WORKSH ANN REAS TI, DOI DOI 10.3115/1629235.1629236
[2]  
Airola A., 2008, P WORKSH CURR TRENDS, P1, DOI DOI 10.3115/1572306.1572308
[3]  
[Anonymous], 2008, Proceedings of ACL-08: HLT
[4]  
[Anonymous], GENOME BIOL S2
[5]  
[Anonymous], 2008, P 46 ANN M ASS COMP
[6]  
[Anonymous], 2006, P 10 C COMP NAT LANG, DOI DOI 10.3115/1596276.1596305
[7]  
[Anonymous], P 2 BIOCREATIVE CHAL
[8]   Semantic role labeling for protein transport predicates [J].
Bethard, Steven ;
Lu, Zhiyong ;
Martin, James H. ;
Hunter, Lawrence .
BMC BIOINFORMATICS, 2008, 9 (1)
[9]  
BJORNE J., 2008, P 3 INT S SEM MIN BI, P125
[10]  
Blaschke C, 1999, Proc Int Conf Intell Syst Mol Biol, P60