A Chinese Named Entity Recognition Method for News Domain Based on Transfer Learning and Word Embeddings

被引:0
作者
Fang, Rui [1 ]
Cui, Liangzhong [1 ]
机构
[1] Naval Univ Engn, Wuhan 430033, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2025年 / 83卷 / 02期
关键词
News domain; named entity recognition (NER); transfer learning; word embeddings; ERNIE; soft-lexicon;
D O I
10.32604/cmc.2025.060422
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Named Entity Recognition (NER) is vital in natural language processing for the analysis of news texts, as it accurately identifies entities such as locations, persons, and organizations, which is crucial for applications like news summarization and event tracking. However, NER in the news domain faces challenges due to insufficient annotated data, complex entity structures, and strong context dependencies. To address these issues, we propose a new Chinese-named entity recognition method that integrates transfer learning with word embeddings. Our approach leverages the ERNIE pre-trained model for transfer learning and obtaining general language representations and incorporates the Soft-lexicon word embedding technique to handle varied entity structures. This dual-strategy enhances the model's understanding of context and boosts its ability to process complex texts. Experimental results show that our method achieves an F1 score of 94.72% on a news dataset, surpassing baseline methods by 3%-4%, thereby confirming its effectiveness for Chinese-named entity recognition in the news domain.
引用
收藏
页码:3247 / 3275
页数:29
相关论文
共 37 条
[1]  
[Anonymous], 2020, Journal of Computer Applications, V40, P1879
[2]  
Cao PF, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P182
[3]  
Chen X., 2015, P 2015 C EMP METH NA, P1197, DOI [DOI 10.18653/V1/D15-1141, 10.18653/V1/D15-1141]
[4]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[5]  
Gui T, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P4982
[6]   Lexicon enhanced Chinese named entity recognition with pointer network [J].
Guo, Qian ;
Guo, Yi .
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (17) :14535-14555
[7]   A chinese named entity recognition method for small-scale dataset based on lexicon and unlabeled data [J].
Huang, Shaobin ;
Sha, Yongpeng ;
Li, Rongsheng .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (02) :2185-2206
[8]  
[金志刚 Jin Zhigang], 2023, [哈尔滨工业大学学报, Journal of Harbin Institute of Technology], V55, P50
[9]  
Lample G., 2016, NEURAL ARCHITECTURES, DOI [10.18653/v1/N16-1030, DOI 10.18653/V1/N16-1030]
[10]  
Li K, 2022, Inf Stud Theory Appl, V45, P184, DOI [10.16353/j.cnki.1000-7490.2022.04.025, DOI 10.16353/J.CNKI.1000-7490.2022.04.025]