A hybrid ontology-based information extraction system

被引:15
|
作者
Gutierrez, Fernando [1 ]
Dou, Dejing [1 ]
Fickas, Stephen [1 ]
Wimalasuriya, Daya [2 ]
Zong, Hui [3 ]
机构
[1] Univ Oregon, Eugene, OR 97403 USA
[2] Univ Moratuwa, Moratuwa, Sri Lanka
[3] Univ Virginia, Charlottesville, VA 22903 USA
基金
美国国家科学基金会;
关键词
Ensemble learning; error detection; information extraction; machine learning; ontology; RETRIEVAL; WEB;
D O I
10.1177/0165551515610989
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information Extraction is the process of automatically obtaining knowledge from plain text. Because of the ambiguity of written natural language, Information Extraction is a difficult task. Ontology-based Information Extraction (OBIE) reduces this complexity by including contextual information in the form of a domain ontology. The ontology provides guidance to the extraction process by providing concepts and relationships about the domain. However, OBIE systems have not been widely adopted because of the difficulties in deployment and maintenance. The Ontology-based Components for Information Extraction (OBCIE) architecture has been proposed as a form to encourage the adoption of OBIE by promoting reusability through modularity. In this paper, we propose two orthogonal extensions to OBCIE that allow the construction of hybrid OBIE systems with higher extraction accuracy and a new functionality. The first extension utilizes OBCIE modularity to integrate different types of implementation into one extraction system, producing a more accurate extraction. For each concept or relationship in the ontology, we can select the best implementation for extraction, or we can combine both implementations under an ensemble learning schema. The second extension is a novel ontology-based error detection mechanism. Following a heuristic approach, we can identify sentences that are logically inconsistent with the domain ontology. Because the implementation strategy for the extraction of a concept is independent of the functionality of the extraction, we can design a hybrid OBIE system with concepts utilizing different implementation strategies for extracting correct or incorrect sentences. Our evaluation shows that, in the implementation extension, our proposed method is more accurate in terms of correctness and completeness of the extraction. Moreover, our error detection method can identify incorrect statements with a high accuracy.
引用
收藏
页码:798 / 820
页数:23
相关论文
共 50 条
  • [31] Ontology-based automated information extraction from building energy conservation codes
    Zhou, Peng
    El-Gohary, Nora
    AUTOMATION IN CONSTRUCTION, 2017, 74 : 103 - 117
  • [32] Ontology-Based Information Extraction for Subject-Focussed Automatic Essay Evaluation
    Ajetunmobi, Stephanie Abimbola
    Daramola, Olawande
    PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON COMPUTING NETWORKING AND INFORMATICS (ICCNI 2017), 2017,
  • [33] Ontology-based information extraction: An introduction and a survey of current approaches
    Wimalasuriya, Daya C.
    Dou, Dejing
    JOURNAL OF INFORMATION SCIENCE, 2010, 36 (03) : 306 - 323
  • [34] Ontology-based interactive information extraction from scientific abstracts
    Milward, D
    Bjäreland, M
    Hayes, W
    Maxwell, M
    Öberg, L
    Tilford, N
    Thomas, J
    Hale, R
    Knight, S
    Barnes, JE
    COMPARATIVE AND FUNCTIONAL GENOMICS, 2005, 6 (1-2): : 67 - 71
  • [35] ONTOLOGY-BASED INFORMATION EXTRACTION FROM PDF DOCUMENTS WITH XONTO
    Oro, Ermelinda
    Ruffolo, Massimo
    Sacca, Domenico
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2009, 18 (05) : 673 - 695
  • [36] Ontology-Based Information Extraction and Reasoning for Business Intelligence Applications
    Declerck, Thierry
    Federmann, Christian
    Kiefer, Bernd
    Krieger, Hans-Ulrich
    KI 2008: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, 5243 : 389 - 390
  • [37] Towards ontology-based information extraction in distributed manufacturing systems
    Li, B. X.
    Yang, L.
    Ong, S. K.
    Lei, Y.
    Nee, A. Y. C.
    INNOVATIVE DEVELOPMENTS IN DESIGN AND MANUFACTURING: ADVANCED RESEARCH IN VIRTUAL AND RAPID PROTOTYPING, 2010, : 483 - 488
  • [38] Evaluation of a Fuzzy Ontology-Based Medical Information System
    Parry, David
    INTERNATIONAL JOURNAL OF HEALTHCARE INFORMATION SYSTEMS AND INFORMATICS, 2006, 1 (01) : 40 - 51
  • [39] An extensible, ontology-based, distributed information system architecture
    Chao, AI
    Krikeles, BC
    Lusignan, AE
    Starczewski, E
    FUSION 2003: PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE OF INFORMATION FUSION, VOLS 1 AND 2, 2003, : 642 - 649
  • [40] Ontology-based information extraction for juridical events with case studies in Brazilian legal realm
    de Araujo D.A.
    Rigo S.J.
    Barbosa J.L.V.
    Artificial Intelligence and Law, 2017, 25 (4) : 379 - 396