Harnessing Causal Structure Alignment for Enhanced Cross-Domain Named Entity Recognition

被引:0
作者
Liu, Xiaoming [1 ,2 ]
Cao, Mengyuan [1 ,3 ]
Yang, Guan [1 ,4 ]
Liu, Jie [2 ,5 ]
Liu, Yang [6 ]
Wang, Hang [1 ,3 ]
机构
[1] Zhongyuan Univ Technol, Sch Comp Sci, Zhengzhou 450007, Peoples R China
[2] China Language Intelligence Res Ctr, Beijing 100089, Peoples R China
[3] Henan Key Lab Publ Opin Intelligent Anal, Zhengzhou 450007, Peoples R China
[4] Zhengzhou Key Lab Text Proc & Image Understanding, Zhengzhou 450007, Peoples R China
[5] North China Univ Technol, Sch Informat Sci, Beijing 100144, Peoples R China
[6] Xidian Univ, Sch Telecommun Engn, Xian 710071, Peoples R China
关键词
cross-domain named entity recognition; transfer learning; causal inference; feature interactions; causally invariant knowledge;
D O I
10.3390/electronics13010067
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cross-domain named entity recognition (NER) is a crucial task in various practical applications, particularly when faced with the challenge of limited data availability in target domains. Existing methodologies primarily depend on feature representation or model parameter sharing mechanisms to enable the transfer of entity recognition capabilities across domains. However, these approaches often ignore the latent causal relationships inherent in invariant features. To address this limitation, we propose a novel framework, the Causal Structure Alignment-based Cross-Domain Named Entity Recognition (CSA-NER) framework, designed to harness the causally invariant features within causal structures to enhance the cross-domain transfer of entity recognition competence. Initially, CSA-NER constructs a causal feature graph utilizing causal discovery to ascertain causal relationships between entities and contextual features across source and target domains. Subsequently, it performs graph structure alignment to extract causal invariant knowledge across domains via the graph optimal transport (GOT) method. Finally, the acquired causal invariant knowledge is refined and utilized through the integration of Gated Attention Units (GAUs). Comprehensive experiments conducted on five English datasets and a specific CD-NER dataset exhibit a notable improvement in the average performance of the CSA-NER model in comparison to existing cross-domain methods. These findings underscore the significance of unearthing and employing latent causal invariant knowledge to effectively augment the entity recognition capabilities in target domains, thereby contributing a robust methodology to the broader realm of cross-domain natural language processing.
引用
收藏
页数:18
相关论文
共 33 条
[1]   A Review on Electronic Health Record Text-Mining for Biomedical Name Entity Recognition in Healthcare Domain [J].
Ahmad, Pir Noman ;
Shah, Adnan Muhammad ;
Lee, KangYoon .
HEALTHCARE, 2023, 11 (09)
[2]  
Arjovsky M, 2020, Arxiv, DOI [arXiv:1907.02893, 10.48550/arXiv.1907.02893]
[3]  
Chen Liqun, 2019, 25 AMERICAS C INFORM
[4]  
Chen X., 2023, P 32 INT JOINT C ART, VVolume 2301, P10410
[5]  
Chen YD, 2022, ADV NEUR IN
[6]  
Chevalley M, 2022, Arxiv, DOI arXiv:2206.11646
[7]  
Cuiy LY, 2021, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, P1835
[8]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[9]   Named Entity Recognition and Classification in Historical Documents: A Survey [J].
Ehrmann, Maud ;
Hamdi, Ahmed ;
Pontes, Elvys Linhares ;
Romanello, Matteo ;
Doucet, Antoine .
ACM COMPUTING SURVEYS, 2024, 56 (02)
[10]   Few-shot classification in Named Entity Recognition Task [J].
Fritzler, Alexander ;
Logacheva, Varvara ;
Kretov, Maksim .
SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, :993-1000