Frame-Based Ontology Population with PIKES

被引:32
作者
Corcoglioniti, Francesco [1 ]
Rospocher, Marco [1 ]
Aprosio, Alessio Palmero [1 ]
机构
[1] Fdn Bruno Kessler, I-38122 Trento, Italy
关键词
Ontology population; Semantic Web; natural language processing; semantic role labeling; FrameBase; KNOWLEDGE-BASE;
D O I
10.1109/TKDE.2016.2602206
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an approach for ontology population from natural language English texts that extracts RDF triples according to FrameBase, a Semantic Web ontology derived from FrameNet. Processing is decoupled in two independently-tunable phases. First, text is processed by several NLP tasks, including Semantic Role Labeling (SRL), whose results are integrated in an RDF graph of mentions, i.e., snippets of text denoting some entity/fact. Then, the mention graph is processed with SPARQL-like rules using a specifically created mapping resource from NomBank/PropBank/FrameNet annotations to FrameBase concepts, producing a knowledge graph whose content is linked to DBpedia and organized around semantic frames, i.e., prototypical descriptions of events and situations. A single RDF/OWL representation is used where each triple is related to the mentions/tools it comes from. We implemented the approach in PIKES, an open source tool that combines two complementary SRL systems and provides a working online demo. We evaluated PIKES on a manually annotated gold standard, assessing precision/recall in (i) populating FrameBase ontology, and (ii) extracting semantic frames modeled after standard predicate models, for comparison with state-of-the-art tools for the Semantic Web. We also evaluated (iii) sampled precision and execution times on a large corpus of 110 K Wikipedia-like pages.
引用
收藏
页码:3261 / 3275
页数:15
相关论文
共 36 条
[1]   RandomWalks for Knowledge- Based Word Sense Disambiguation [J].
Agirre, Eneko ;
Lopez de Lacalle, Oier ;
Soroa, Aitor .
COMPUTATIONAL LINGUISTICS, 2014, 40 (01) :57-84
[2]  
[Anonymous], 2016, PROC 31 ANN ACM S AP
[3]  
[Anonymous], 2011, P 7 INT C SEM SYST, DOI [10.1145/2063518.2063519, DOI 10.1145/2063518.2063519]
[4]  
Augenstein Isabelle, 2012, The Semantic Web: Research and Applications. Proceedings 9th Extended Semantic Web Conference (ESWC 2012), P210, DOI 10.1007/978-3-642-30284-8_21
[5]  
Baker C.F., 1998, P 36 ANN M ASS COMP, P86, DOI DOI 10.3115/980845.980860
[6]  
Cimiano Philipp, 2006, Ontology Learning and Population from Text, DOI DOI 10.1007/978-0-387-39252-3
[7]  
Corcoglioniti F, 2016, LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P877
[8]   Processing Billions of RDF Triples on a Single Machine using Streaming and Sorting [J].
Corcoglioniti, Francesco ;
Rospocher, Marco ;
Mostarda, Michele ;
Amadori, Marco .
30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, :368-375
[9]   Knowledge Extraction for Information Retrieval [J].
Corcoglioniti, Francesco ;
Dragoni, Mauro ;
Rospocher, Marco ;
Aprosio, Alessio Palmero .
SEMANTIC WEB: LATEST ADVANCES AND NEW DOMAINS, 2016, 9678 :317-333
[10]  
Corcoglioniti F, 2015, INT J SEMANT WEB INF, V11, P1, DOI [10.4018/ijswis.2015040101, 10.4018/IJSWIS.2015040101]