Integration of Relation Filtering and Multi-Task Learning in GlobalPointer for Entity and Relation Extraction

被引:0
作者
Liu, Bin [1 ,2 ]
Tao, Jialin [1 ,2 ]
Chen, Wanyuan [1 ,2 ]
Zhang, Yijie [1 ,2 ]
Chen, Min [1 ,2 ]
He, Lei [1 ,2 ]
Tang, Dan [1 ,2 ]
机构
[1] Chengdu Univ Informat Technol, Sch Software Engn, Chengdu 610225, Peoples R China
[2] Sichuan Prov Engn Technol Res Ctr Support Software, Chengdu 610225, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 15期
关键词
knowledge graph; joint entity and relation extraction; overlapping triplets; multi-task learning;
D O I
10.3390/app14156832
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The rise of knowledge graphs has been instrumental in advancing artificial intelligence (AI) research. Extracting entity and relation triples from unstructured text is crucial for the construction of knowledge graphs. However, Chinese text has a complex grammatical structure, which may lead to the problem of overlapping entities. Previous pipeline models have struggled to address such overlap problems effectively, while joint models require entity annotations for each predefined relation in the set, which results in redundant relations. In addition, the traditional models often lead to task imbalance by overlooking the differences between tasks. To tackle these challenges, this research proposes a global pointer network based on relation prediction and loss function improvement (GPRL) for joint extraction of entities and relations. Experimental evaluations on the publicly available Chinese datasets DuIE2.0 and CMeIE demonstrate that the GPRL model achieves a 1.2-26.1% improvement in F1 score compared with baseline models. Further, experiments of overlapping classification conducted on CMeIE have also verified the effectiveness of overlapping triad extraction and ablation experiments. The model is helpful in identifying entities and relations accurately and can reduce redundancy by leveraging relation filtering and the global pointer network. In addition, the incorporation of a multi-task learning framework balances the loss functions of multiple tasks and enhances task interactions.
引用
收藏
页数:17
相关论文
共 46 条
[1]  
[Anonymous], 2012, Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, DOI DOI 10.1162/153244303322533223
[2]  
[Anonymous], 2014, P COLING 2014 25 INT
[3]  
[Anonymous], 2015, P 2015 C EMP METH NA, DOI DOI 10.18653/V1/D15-1206
[4]  
Buckman J., 2018, INT C LEARN REPR
[5]  
Chen Z, 2018, PR MACH LEARN RES, V80
[6]   Knowledge Graph-based Event Embedding Framework for Financial Quantitative Investments [J].
Cheng, Dawei ;
Yang, Fangzhou ;
Wang, Xiaoyang ;
Zhang, Ying ;
Zhang, Liqing .
PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, :2221-2230
[7]  
Chowdhary KR1442, 2020, Natural language processing. Fundamentals of artificial intelligence, P603, DOI DOI 10.1007/978-81-322-3972-7_19
[8]   Emerging Trends Word2Vec [J].
Church, Kenneth Ward .
NATURAL LANGUAGE ENGINEERING, 2017, 23 (01) :155-162
[9]  
Crawshaw M., 2020, arXiv
[10]   Pre-Training With Whole Word Masking for Chinese BERT [J].
Cui, Yiming ;
Che, Wanxiang ;
Liu, Ting ;
Qin, Bing ;
Yang, Ziqing .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 :3504-3514