A large-scale dataset for korean document-level relation extraction from encyclopedia texts

被引:0
|
作者
Son, Suhyune [1 ]
Lim, Jungwoo [1 ]
Koo, Seonmin [1 ]
Kim, Jinsung [1 ]
Kim, Younghoon [2 ]
Lim, Youngsik [2 ]
Hyun, Dongseok [2 ]
Lim, Heuiseok [1 ]
机构
[1] Korea Univ, Comp Sci & Engn, 1 5-ka,Anam Dong, Seoul 02841, South Korea
[2] NAVER, 5 Jeongjail ro,Buljeong ro, Seongnam 13561, South Korea
基金
新加坡国家研究基金会;
关键词
Natural Language Processing; Information Extraction; Document-level Relation Extraction; Korean Relation Extraction; ENTITY;
D O I
10.1007/s10489-024-05605-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document-level relation extraction (RE) aims to predict the relational facts between two given entities from a document. Unlike widespread research on document-level RE in English, Korean document-level RE research is still at the very beginning due to the absence of a dataset. To accelerate the studies, we present TREK (Toward Document-Level Relation Extraction in Korean) dataset constructed from Korean encyclopedia documents written by the domain experts. We provide detailed statistical analyses for our large-scale dataset and human evaluation results suggest the assured quality of TREK . Also, we introduce the document-level RE model that considers the named entity-type while considering the Korean language's properties. In the experiments, we demonstrate that our proposed model outperforms the baselines and conduct qualitative analysis.
引用
收藏
页码:8681 / 8701
页数:21
相关论文
共 50 条
  • [31] Document-level relation extraction with multi-semantic knowledge interaction
    Hou, Wenlong
    Wu, Wenda
    Liu, Xianhui
    Zhao, Weidong
    INFORMATION SCIENCES, 2024, 679
  • [32] Multi-relation Identification for Few-Shot Document-Level Relation Extraction
    Wang, Dazhuang
    Wu, Shaojuan
    Zhang, Xiaowang
    Feng, Zhiyong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IX, 2023, 14262 : 52 - 64
  • [33] Biomedical document-level relation extraction with thematic capture and localized entity pooling
    Li, Yuqing
    Shao, Xinhui
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 160
  • [34] Mining heuristic evidence sentences for more interpretable document-level relation extraction
    Zhu, Taojie
    Lu, Jicang
    Zhou, Gang
    Ding, Xiaoyao
    Guo, Panpan
    Wu, Hao
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (07)
  • [35] Deconstructing reasoning paths and attending to semantic guidance for document-level relation extraction
    Zhong, Yu
    Shen, Bo
    Wang, Tao
    KNOWLEDGE-BASED SYSTEMS, 2024, 301
  • [36] Document-Level Relation Extraction with a Dependency Syntax Transformer and Supervised Contrastive Learning
    Yang, Ming
    Zhang, Yijia
    Banbhrani, Santosh Kumar
    Lin, Hongfei
    Lu, Mingyu
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE GRAPH EMPOWERS THE DIGITAL ECONOMY, CCKS 2022, 2022, 1669 : 43 - 54
  • [37] Self-supervised commonsense knowledge learning for document-level relation extraction
    Li, Rongzhen
    Zhong, Jiang
    Xue, Zhongxuan
    Dai, Qizhu
    Li, Xue
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 250
  • [38] Enhancing Document-Level Relation Extraction with Attention-Convolutional Hybrid Networks and Evidence Extraction
    Zhang, Feiyu
    Hu, Ruiming
    Duan, Guiduo
    Huang, Tianxi
    COGNITIVE COMPUTATION, 2024, : 1113 - 1124
  • [39] Document-Level Relation Extraction Based on Fine-Grained Information Guidance
    Pu, Chujun
    Zhang, Xuejie
    Wang, Jin
    Zhou, Xiaobing
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14877 : 378 - 390
  • [40] NA-Aware Machine Reading Comprehension for Document-Level Relation Extraction
    Zhang, Zhenyu
    Yu, Bowen
    Shu, Xiaobo
    Liu, Tingwen
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT III, 2021, 12977 : 580 - 595