Xart: Discovery of correlated arguments of n-ary relations in text

被引:6
作者
Berrahou, Soumia Lilia [1 ,2 ]
Buche, Patrice [1 ,2 ]
Dibie, Juliette [3 ]
Roche, Mathieu [1 ,4 ]
机构
[1] LIRMM, 860,Rue St Priest, F-34095 Montpellier, France
[2] INRA, UMR IATE, 2,Pl Pierre Viala, F-34060 Montpellier, France
[3] Univ Paris Saclay, INRA, AgroParisTech, UMR MIA Paris, F-75005 Paris, France
[4] CIRAD, UMR TETIS, 500,Rue JF Breton, F-34093 Montpellier, France
关键词
Information extraction; N-ary relation; Ontology; Data mining; Sequential pattern; Quantitative data; Linguistic pattern; INFORMATION; ONTOLOGY; PATTERNS; UNITS;
D O I
10.1016/j.eswa.2016.12.028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Here we present the Xart system based on a three-step hybrid method using data mining approaches and syntactic analysis to automatically discover and extract relevant data modeled as n-ary relations in plain text. A n-ary relation links a studied object with its features considered as several arguments. We addressed the challenge of designing a novel method to handle the identification and extraction of heterogeneous arguments such as symbolic arguments, quantitative arguments composed of numbers and various measurement units. We thus developed the Xart system, which relies on a domain ontology for discovering patterns, in plain text, to identify arguments involved in n-ary relations. The discovered patterns take advantage of different ontological levels that facilitate identification of all arguments and pool them in the sought n-ary relation. (c) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:115 / 124
页数:10
相关论文
共 46 条
[1]  
AGRAWAL R, 1995, PROC INT CONF DATA, P3, DOI 10.1109/ICDE.1995.380415
[2]  
Agrawal R., 1994, P 20 INT C VER LARG, V1215, P487, DOI DOI 10.5555/645920.672836
[3]  
[Anonymous], INFORM PROCESSING 1, DOI DOI 10.1016/S0306-4573(00)00015-7
[4]  
Bechet Nicolas, 2012, Computational Linguistics and Intelligent Text Processing. Proceedings 13th International Conference (CICLing 2012), P154, DOI 10.1007/978-3-642-28604-9_13
[5]  
Berrahou S. L., 2016, P 6 INT C WEB INT MI
[6]  
Bjrne J., 2009, P BIONLP 09 SHARED T, P10
[7]  
Buche Patrice, 2013, Revue d'Intelligence Artificielle, V27, P539, DOI 10.3166/RIA.27.539-568
[8]   Fuzzy Web Data Tables Integration Guided by an Ontological and Terminological Resource [J].
Buche, Patrice ;
Dibie-Barthelemy, Juliette ;
Ibanescu, Liliana ;
Soler, Lydie .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (04) :805-819
[9]  
Bui Q.C., 2011, BioNLP Shared Task 2011 Workshop, P143
[10]  
Buyko E., 2009, P WORKSHOP CURRENT T, P19