Entity and relation extraction with rule-guided dictionary as domain knowledge

被引:0
作者
WANG Xinzhi [1 ]
LI Jiahao [1 ]
ZHENG Ze [2 ]
CHANG Yudong [1 ]
ZHU Min [3 ]
机构
[1] School of Computer Engineering and Science, Shanghai University, Shanghai , China
[2] Baidu (China) Co, Ltd, Beijing , China
[3] The Sixth Medical Center of PLA General Hospital, Beijing , China
关键词
entity extraction; relation extraction; prior knowledge; domain rule;
D O I
暂无
中图分类号
TP391.1 [文字信息处理]; F204 [科学技术管理];
学科分类号
081203 ; 0835 ; 020201 ;
摘要
Entity and relation extraction is an indispensable part of domain knowledge graph construction, which can serve relevant knowledge needs in a specific domain, such as providing support for product research, sales, risk control, and domain hotspot analysis. The existing entity and relation extraction methods that depend on pretrained models have shown promising performance on open datasets. However, the performance of these methods degrades when they face domain-specific datasets. Entity extraction models treat characters as basic semantic units while ignoring known character dependency in specific domains. Relation extraction is based on the hypothesis that the relations hidden in sentences are unified, thereby neglecting that relations may be diverse in different entity tuples. To address the problems above, this paper first introduced prior knowledge composed of domain dictionaries to enhance characters’ dependence. Second, domain rules were built to eliminate noise in entity relations and promote potential entity relation extraction. Finally, experiments were designed to verify the effectiveness of our proposed methods. Experimental results on two domains, including laser industry and unmanned ship, showed the superiority of our methods. The F1 value on laser industry entity, unmanned ship entity, laser industry relation, and unmanned ship relation datasets is improved by +1%, +6%, +2%, and +1%, respectively. In addition, the extraction accuracy of entity relation triplet reaches 83% and 76% on laser industry entity pair and unmanned ship entity pair datasets, respectively.
引用
收藏
页码:610 / 622
页数:13
相关论文
empty
未找到相关数据