DptOIE: a Portuguese open information extraction based on dependency analysis

被引:0
|
作者
Leandro Oliveira
Daniela Barreiro Claro
Marlo Souza
机构
[1] Federal University of Bahia (UFBA)–Institute of Computing (IC),FORMAS Research Group
来源
Artificial Intelligence Review | 2023年 / 56卷
关键词
Open information extraction; Dependency analysis; Few-resource languages;
D O I
暂无
中图分类号
学科分类号
摘要
It is estimated that more than 80% of the information on the Web is stored in textual form. As such, it has become increasingly difficult for humans to sort and extract useful information from the daily influx of data. In order to automate this process, open information extraction (OIE) methods have been proposed, which can extract facts from large textual bases. While most OIE methods were initially developed for the English language, the importance of developing methods for other languages, such as Portuguese, has been increasingly recognized in recent literature. OIE methods based on hand-crafted rules and shallow syntactic analysis have achieved good performances for the English language. Nevertheless, methods based on similar approaches in the Portuguese language have not achieved equivalent success. We believe that the shallow syntactic patterns previously explored in the literature do not cover important aspects of the Portuguese language syntax. For this reason, we propose the DptOIE method based on a new set of syntax-based rules using dependency parsers and a depth-first search (DFS) algorithm for OIE and a set of grammar-based rules to cover specific syntactic phenomena of the language. DptOIE was compared against the state-of-the-art OIE for the Portuguese language, obtaining favorable results both in our empirical evaluation and at the IberLEF evaluation track of OIE systems for the Portuguese language. Furthermore, we believe our method can be easily adapted to other Romance languages related to Portuguese.
引用
收藏
页码:7015 / 7046
页数:31
相关论文
共 50 条
  • [1] DptOIE: a Portuguese open information extraction based on dependency analysis
    Oliveira, Leandro
    Claro, Daniela Barreiro
    Souza, Marlo
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (07) : 7015 - 7046
  • [2] PragmaticOIE: a pragmatic open information extraction for Portuguese language
    Cleiton Fernando Lima Sena
    Daniela Barreiro Claro
    Knowledge and Information Systems, 2020, 62 : 3811 - 3836
  • [3] InferPortOIE: A Portuguese Open Information Extraction system with inferences
    Lima Sena, Cleiton Fernando
    Claro, Daniela Barreiro
    NATURAL LANGUAGE ENGINEERING, 2019, 25 (02) : 287 - 306
  • [4] PragmaticOIE: a pragmatic open information extraction for Portuguese language
    Sena, Cleiton Fernando Lima
    Claro, Daniela Barreiro
    KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (09) : 3811 - 3836
  • [5] Challenges of an Annotation Task for Open Information Extraction in Portuguese
    Glauber, Rafael
    de Oliveira, Leandro Souza
    Lima Sena, Cleiton Fernando
    Claro, Daniela Barreiro
    Souza, Marlo
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 66 - 76
  • [6] Inference Approach to Enhance a Portuguese Open Information Extraction
    Lima Sena, Cleiton Fernando
    Glauber, Rafael
    Claro, Daniela Barreiro
    ICEIS: PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 1, 2017, : 442 - 451
  • [7] PortNOIE: A Neural Framework for Open Information Extraction for the Portuguese Language
    Cabral, Bruno
    Souza, Marlo
    Claro, Daniela Barreiro
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2022, 2022, 13208 : 243 - 255
  • [8] Task-Oriented Evaluation of Dependency Parsing with Open Information Extraction
    Gamallo, Pablo
    Garcia, Marcos
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 77 - 82
  • [9] Open information extraction based on lexical semantics
    Xavier, Clarissa Castellã
    Strube de Lima, Vera Lúcia
    Souza, Marlo
    Journal of the Brazilian Computer Society, 2015, 21 (01)
  • [10] Multi-Grained Dependency Graph Neural Network for Chinese Open Information Extraction
    Lyu, Zhiheng
    Shi, Kaijie
    Li, Xin
    Hou, Lei
    Li, Juanzi
    Song, Binheng
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT III, 2021, 12714 : 155 - 167