Leveraging knowledge graph for domain-specific Chinese named entity recognition via lexicon-based relational graph transformer

被引:5
作者
Gao, Yunbo [1 ]
Gong, Guanghong [1 ]
Ye, Bipeng [1 ]
Tian, Xingyu [1 ]
Li, Ni [1 ]
Yuan, Haitao [1 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China
关键词
deep learning; knowledge graph; Chinese named entity recognition; CNER; lexicon augmentation; relational graph transformer; RGT; lexicon-based relational graph transformer; LRGT;
D O I
10.1504/IJBIC.2023.131912
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Leveraging knowledge graphs (KGs) has been an emerging direction to improve the performance of deep learning-based Chinese named entity recognition (CNER). Nevertheless, most existing methods directly inject correlated words into sentences but ignore word boundaries that are crucial for CNER. Conflicts among incorrect word segmentations may misguide models to predict incorrect labels. To solve this problem, this work investigates a novel lexicon-based relational graph transformer (LRGT), which combines relational graph-structured inputs and transformer tailored for lexicon-augmented CNER. In LRGT, characters and self-matched lexicon words are fully interacted through a two-phase relational graph softmax message passing mechanism. The finally enhanced character representation in LRGT dynamically integrates both lexical and relative positional information, which is distinguishable for the identification. Results on four benchmark datasets demonstrate that LRGT significantly outperforms several state-of-the-art methods. We further demonstrate that LRGT with KG achieves higher performance on two public specific-domain CNER datasets. LRGT performs up to 3.35 times faster than several typical baselines while achieving better F1-score by up to 1.92% and 2.24%, respectively.
引用
收藏
页码:148 / 162
页数:16
相关论文
共 37 条
[1]   Investigating Typed Syntactic Dependencies for Targeted Sentiment Classification Using Graph Attention Neural Network [J].
Bai, Xuefeng ;
Liu, Pengbo ;
Zhang, Yue .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 (503-514) :503-514
[2]  
Bordes A., 2013, P 26 INT C NEUR INF, V2, P2787
[3]   Learning graph normalization for graph neural networks [J].
Chen, Yihao ;
Tang, Xin ;
Qi, Xianbiao ;
Li, Chun-Guang ;
Xiao, Rong .
NEUROCOMPUTING, 2022, 493 :613-625
[4]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[5]  
Ding RX, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P1462
[6]  
Dwivedi VP, 2012, AAAI 2021 WORKSHOP D, DOI DOI 10.48550/ARXIV.2012.09699
[7]  
Gui T, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P4982
[8]  
Gui T, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P1040
[9]  
Guo QP, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P1315
[10]  
Han X, 2018, CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, P139