A hybrid ontology-based information extraction system

被引:15
|
作者
Gutierrez, Fernando [1 ]
Dou, Dejing [1 ]
Fickas, Stephen [1 ]
Wimalasuriya, Daya [2 ]
Zong, Hui [3 ]
机构
[1] Univ Oregon, Eugene, OR 97403 USA
[2] Univ Moratuwa, Moratuwa, Sri Lanka
[3] Univ Virginia, Charlottesville, VA 22903 USA
基金
美国国家科学基金会;
关键词
Ensemble learning; error detection; information extraction; machine learning; ontology; RETRIEVAL; WEB;
D O I
10.1177/0165551515610989
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information Extraction is the process of automatically obtaining knowledge from plain text. Because of the ambiguity of written natural language, Information Extraction is a difficult task. Ontology-based Information Extraction (OBIE) reduces this complexity by including contextual information in the form of a domain ontology. The ontology provides guidance to the extraction process by providing concepts and relationships about the domain. However, OBIE systems have not been widely adopted because of the difficulties in deployment and maintenance. The Ontology-based Components for Information Extraction (OBCIE) architecture has been proposed as a form to encourage the adoption of OBIE by promoting reusability through modularity. In this paper, we propose two orthogonal extensions to OBCIE that allow the construction of hybrid OBIE systems with higher extraction accuracy and a new functionality. The first extension utilizes OBCIE modularity to integrate different types of implementation into one extraction system, producing a more accurate extraction. For each concept or relationship in the ontology, we can select the best implementation for extraction, or we can combine both implementations under an ensemble learning schema. The second extension is a novel ontology-based error detection mechanism. Following a heuristic approach, we can identify sentences that are logically inconsistent with the domain ontology. Because the implementation strategy for the extraction of a concept is independent of the functionality of the extraction, we can design a hybrid OBIE system with concepts utilizing different implementation strategies for extracting correct or incorrect sentences. Our evaluation shows that, in the implementation extension, our proposed method is more accurate in terms of correctness and completeness of the extraction. Moreover, our error detection method can identify incorrect statements with a high accuracy.
引用
收藏
页码:798 / 820
页数:23
相关论文
共 50 条
  • [21] SUSIE: Pharmaceutical CMC ontology-based information extraction for drug machine
    Mann, Vipul
    Viswanath, Shekhar
    Vaidyaraman, Shankar
    Balakrishnan, Jeya
    Venkatasubramanian, Venkat
    COMPUTERS & CHEMICAL ENGINEERING, 2023, 179
  • [22] Ontology-Based Information Extraction for Populating the Intelligent Scientific Internet Resources
    Akhmadeeva, Irina R.
    Zagorulko, Yury A.
    Mouromtsev, Dmitry I.
    KNOWLEDGE ENGINEERING AND SEMANTIC WEB, KESW 2016, 2016, 649 : 119 - 128
  • [23] Ontology-based intelligent information retrieval system
    Yang, Yue-Hua
    Du, Jun-Ping
    Ping, Yuan
    Ruan Jian Xue Bao/Journal of Software, 2015, 26 (07): : 1675 - 1687
  • [24] Ontology-based Geographic Information System for Environment
    Zhang Zeliang
    Wang Danping
    Yang Chengjia
    6TH INTERNATIONAL SYMPOSIUM OF ASIA INSTITUTE OF URBAN ENVIRONMENT: ENERGY CONSERVATION AND CARBON OFF IN ASIA CITY, 2009, : 164 - 168
  • [25] Ontology-based requirements elicitation of information system
    Zhai, Li-Li
    Zhang, Tao
    Peng, Ding-Hong
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2013, 19 (01): : 173 - 180
  • [26] Towards an Ontology-based Soil Information System
    Shu, Yanfeng
    Liu, Qing
    21ST INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION (MODSIM2015), 2015, : 1462 - 1468
  • [27] Ontology-Based Feature Modeling for Construction Information Extraction from a Building Information Model
    Nepal, Madhav Prasad
    Staub-French, Sheryl
    Pottinger, Rachel
    Zhang, Jiemin
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2013, 27 (05) : 555 - 569
  • [28] Visual Ontology-based Information Retrieval System
    Zhuhadar, Leyla
    Nasraoui, Olfa
    Wyatt, Robert
    INFORMATION VISUALIZATION, IV 2009, PROCEEDINGS, 2009, : 419 - 426
  • [29] An ontology-based comparative anatomy information system
    Travillian, Ravensara S.
    Diatchka, Kremena
    Judge, Tejinder K.
    Wilamowska, Katarzyna
    Shapiro, Linda G.
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2011, 51 (01) : 1 - 15
  • [30] Ontology-Based Hazard Information Extraction from Chinese Food Complaint Documents
    Yang, Xiquan
    Gao, Rui
    Han, Zhengfu
    Sui, Xin
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2012, PT II, 2012, 7332 : 155 - 163