A large-scale dataset for korean document-level relation extraction from encyclopedia texts

被引:0
|
作者
Son, Suhyune [1 ]
Lim, Jungwoo [1 ]
Koo, Seonmin [1 ]
Kim, Jinsung [1 ]
Kim, Younghoon [2 ]
Lim, Youngsik [2 ]
Hyun, Dongseok [2 ]
Lim, Heuiseok [1 ]
机构
[1] Korea Univ, Comp Sci & Engn, 1 5-ka,Anam Dong, Seoul 02841, South Korea
[2] NAVER, 5 Jeongjail ro,Buljeong ro, Seongnam 13561, South Korea
基金
新加坡国家研究基金会;
关键词
Natural Language Processing; Information Extraction; Document-level Relation Extraction; Korean Relation Extraction; ENTITY;
D O I
10.1007/s10489-024-05605-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document-level relation extraction (RE) aims to predict the relational facts between two given entities from a document. Unlike widespread research on document-level RE in English, Korean document-level RE research is still at the very beginning due to the absence of a dataset. To accelerate the studies, we present TREK (Toward Document-Level Relation Extraction in Korean) dataset constructed from Korean encyclopedia documents written by the domain experts. We provide detailed statistical analyses for our large-scale dataset and human evaluation results suggest the assured quality of TREK . Also, we introduce the document-level RE model that considers the named entity-type while considering the Korean language's properties. In the experiments, we demonstrate that our proposed model outperforms the baselines and conduct qualitative analysis.
引用
收藏
页码:8681 / 8701
页数:21
相关论文
共 50 条
  • [21] Document-level relation extraction with structural encoding and entity-pair-level information interaction
    Liu, Wanlong
    Xiao, Yichen
    Cheng, Shaohuan
    Zeng, Dingyi
    Zhou, Li
    Kong, Weishan
    Zhang, Malu
    Chen, Wenyu
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 268
  • [22] Document-level Relation Extraction via Separate Relation Representation and Logical Reasoning
    Huang, Heyan
    Yuan, Changsen
    Liu, Qian
    Cao, Yixin
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (01)
  • [23] Enhancing Document-Level Relation Extraction with Entity Pronoun Resolution and Relation Correlation
    Pi, Qiankun
    Lu, Jicang
    Sun, Yepeng
    Zhu, Taojie
    Xia, Yi
    Yang, Chenguang
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT II, NLPCC 2024, 2025, 15360 : 174 - 186
  • [24] Document-Level Iterative Entity and Relation Extraction for Materials Scientific Literature
    Geng, Qiqi
    You, Jinguo
    Guo, Huayi
    Huang, Xingrui
    Tao, Jingmei
    Yi, Jianhong
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14877 : 499 - 510
  • [25] Heterogenous affinity graph inference network for document-level relation extraction
    Li, Rongzhen
    Zhong, Jiang
    Xue, Zhongxuan
    Dai, Qizhu
    Li, Xue
    KNOWLEDGE-BASED SYSTEMS, 2022, 250
  • [26] Multi-granularity Neural Networks for Document-Level Relation Extraction
    Chen, Xiye
    Wang, Peng
    WEB AND BIG DATA, APWEB-WAIM 2024, PT V, 2024, 14965 : 95 - 112
  • [27] CRFLOE: Context Region Filter and Relation Word Aware for Document-Level Relation Extraction
    Yang, DanPing
    Li, XianXian
    Wu, Hao
    Zhou, Aoxiang
    Liu, Peng
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14877 : 102 - 114
  • [28] Document-Level Relation Extraction with Additional Evidence and Entity Type Information
    Li, Jinliang
    Wang, Junlei
    Li, Canyu
    Liu, Xiaojing
    Feng, Zaiwen
    Qin, Li
    Mayer, Wolfgang
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14877 : 226 - 237
  • [29] Dual-stream dynamic graph structure network for document-level relation extraction
    Zhong, Yu
    Shen, Bo
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (09)
  • [30] Document-Level Relation Extraction Method Based on Attention Semantic Enhancement
    Liu X.
    Wu W.
    Zhao W.
    Hou W.
    Tongji Daxue Xuebao/Journal of Tongji University, 2024, 52 (05): : 822 - 828