PragmaticOIE: a pragmatic open information extraction for Portuguese language

被引:7
作者
Sena, Cleiton Fernando Lima [1 ]
Claro, Daniela Barreiro [1 ]
机构
[1] Univ Fed Bahia, FORMAS Res Grp, Dept Comp Sci, FORMAS LaSiD DCC IME UFBA, Ave Adhemar de Barros, BR-40170110 Salvador, BA, Brazil
关键词
Open information extraction; Relation extraction; Inference; Context; Intention; Pragmatics;
D O I
10.1007/s10115-020-01442-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information extraction (IE) involves the extraction of useful facts from texts. IE approaches have been categorized into two types: Traditional IE and Open IE. Traditional IE recognizes a predefined set of relationships between the arguments, and it has typically been applied to specific domains. Open IE extracts relationship descriptors expressing any semantic relationship between a pair of arguments in different domains. Although a sentence can have a different meaning, given the context and intention used, a single semantic analysis does not guarantee useful extractions. Extractions depend on the context and the intention inherited in a sentence that goes beyond the semantic meaning. Thus, a pragmatic analysis enhances the set of extractions by considering the contextual and intentional aspects. As a consequence, new facts can be extracted from this set of sentences. The combination of inference, context, and intention enables the extraction of implicit facts from texts achieving a first pragmatic level. This novel approach increases the number of facts, extracting relationships from a sentence analyzing inference, context, and intention. This is the first method to analyze a first pragmatic level from a sentence within a set of Portuguese text documents. Our method was performed over a set of Portuguese text documents and outperforms the most relevant related work comparing accuracy, number of extracted facts, and minimality measures.
引用
收藏
页码:3811 / 3836
页数:26
相关论文
共 33 条
[1]  
[Anonymous], HLT
[2]  
[Anonymous], 2012, P JOINT WORKSHOP AUT
[3]  
[Anonymous], 2016, P 25 INT JOINT C ART, VVolume 2016-Janua, P4074
[4]  
Banko M, 2007, 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2670
[5]  
Bast Hannah, 2014, Advances in Information Retrieval. 36th European Conference on IR Research, ECIR 2014. Proceedings: LNCS 8416, P585, DOI 10.1007/978-3-319-06028-6_61
[6]   Open Information Extraction via Contextual Sentence Decomposition [J].
Bast, Hannah ;
Haussmann, Elmar .
2013 IEEE SEVENTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2013), 2013, :154-159
[7]  
Bluhdorn H, 1997, REV ESTUD LING, V6, P150
[8]   Open information extraction based on lexical semantics [J].
Xavier, Clarissa Castellã ;
Strube de Lima, Vera Lúcia ;
Souza, Marlo .
Journal of the Brazilian Computer Society, 2015, 21 (01)
[9]  
Colen W, 2013, P STIL, P21
[10]  
da Costa JC, 2009, LET HOJE, V44, P12