An adaptable, high-performance relation extraction system for complex sentences

被引:2
|
作者
Thomas, Anu [1 ,2 ]
Sivanesan, Sangeetha [1 ]
机构
[1] Natl Inst Technol, Dept Comp Applicat, Text Analyt & NLP Lab, Tiruchirappalli, India
[2] St Georges Coll, Dept Comp Applicat, Aruvithura, Kerala, India
关键词
Information extraction; Natural language processing; Domain -specific relation extraction; Domain ontology; Semi -supervised learning; Knowledge based system; Judicial text; Open information extraction; Domain adaptability; INFORMATION EXTRACTION;
D O I
10.1016/j.knosys.2022.108956
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid proliferation of text data has lead to an increase in the use of Information Extraction (IE) techniques to automatically extract key information in a fast and effective manner. Relation Extraction (RE), a sub-task of IE focuses on extracting semantic relations from free natural language text and is crucial for further applications including Question Answering, Information Retrieval, Knowledge Base construction, Text Summarization, etc. Literature shows that supervised learning approaches were widely used in RE. However, the performance of supervised methodologies depend on the availability of domain-specific annotated datasets which is not viable for many of the domains including legal, financial, insurance etc. In recent times, Open Information Extraction (OIE) techniques address this issue, by facilitating domain-independent extraction of relations from large text corpora with no demand for domain-specific tagged data and predefined relation classes. Even though OIE systems are fast and simple to implement, they are less effective in handling complex sentences, and often produce redundant extractions. This paper proposes an efficient RE system to extract domain-specific relations from natural language text, consisting of Knowledge-based and Semi-supervised learning systems, integrated with domain ontology. We evaluated the performance of proposed work on 'judicial domain " as a use case and found that it overcomes the flaws and limitations of existing RE approaches, by achieving better results in terms of precision and recall. On further analysis, we found that the proposed system outperforms existing cutting-edge OIE systems on varying sentence length and complexity. (C) 2022 Published by Elsevier B.V.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] High-performance unsupervised relation extraction from large corpora
    Rozenfeld, Binjamin
    Feldman, Ronen
    ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 1032 - +
  • [2] On the form of parsed sentences for relation extraction
    Chen, Xiaoying
    Zhang, Mi
    Xiong, Shengwu
    Qian, Tieyun
    KNOWLEDGE-BASED SYSTEMS, 2022, 251
  • [3] ADIOS 2: The Adaptable Input Output System. A framework for high-performance data management
    Godoy, William F.
    Podhorszki, Norbert
    Wang, Ruonan
    Atkins, Chuck
    Eisenhauer, Greg
    Gu, Junmin
    Davis, Philip
    Choi, Jong
    Germaschewski, Kai
    Huck, Kevin
    Huebl, Axel
    Kim, Mark
    Kress, James
    Kurc, Tahsin
    Liu, Qing
    Logan, Jeremy
    Mehta, Kshitij
    Ostrouchov, George
    Parashar, Manish
    Poeschel, Franz
    Pugmire, David
    Suchyta, Eric
    Takahashi, Keichi
    Thompson, Nick
    Tsutsumi, Seiji
    Wan, Lipeng
    Wolf, Matthew
    Wu, Kesheng
    Klasky, Scott
    SOFTWAREX, 2020, 12
  • [4] Adaptable, high recall, event extraction system with minimal configuration
    Miwa, Makoto
    Ananiadou, Sophia
    BMC BIOINFORMATICS, 2015, 16
  • [5] Adaptable, high recall, event extraction system with minimal configuration
    Makoto Miwa
    Sophia Ananiadou
    BMC Bioinformatics, 16
  • [6] Relation Extraction with Tree Kernel for Indonesian Sentences
    Suwarningsih, Wiwin
    Supriana, Iping
    2014 INTERNATIONAL CONFERENCE ON ICT FOR SMART SOCIETY (ICISS), 2014, : 22 - 26
  • [7] Biobased epoxy covalent adaptable networks for high-performance recoverable adhesives
    Zhao, Xiao-Li
    Li, Yi-Dong
    Weng, Yunxuan
    Zeng, Jian-Bing
    INDUSTRIAL CROPS AND PRODUCTS, 2023, 192
  • [8] High-Performance Method for Brain Tumor Feature Extraction in MRI Using Complex Network
    Trong, Thanh Han
    Van, Hinh Nguyen
    Dang, Luu Vu
    APPLIED BIONICS AND BIOMECHANICS, 2023, 2023
  • [9] Are Noisy Sentences Useless for Distant Supervised Relation Extraction?
    Shang, Yu-Ming
    Huang, He-Yan
    Mao, Xian-Ling
    Sun, Xin
    Wei, Wei
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8799 - 8806
  • [10] A high-performance protocol for extraction of microplastics in fish
    Karami, Ali
    Golieskardi, Abolfazl
    Choo, Cheng Keong
    Romano, Nicholas
    Bin Ho, Yu
    Salamatinia, Babak
    SCIENCE OF THE TOTAL ENVIRONMENT, 2017, 578 : 485 - 494