Joint multi-view character embedding model for named entity recognition of Chinese car reviews

被引:4
作者
Ding, Jiaming [1 ,2 ]
Xu, Wenping [3 ]
Wang, Anning [1 ,2 ]
Zhao, Shuangyao [1 ,2 ]
Zhang, Qiang [1 ,2 ]
机构
[1] Hefei Univ Technol, Sch Management, Hefei 230009, Peoples R China
[2] Minist Educ, Key Lab Proc Optimizat & Intelligent Decismaking, Hefei 230009, Peoples R China
[3] Weichai Power Co Ltd, Weifang 261061, Peoples R China
基金
中国国家自然科学基金;
关键词
Named entity recognition; Multi-view character embedding; Domain-specific knowledge; Deep learning; Natural language processing;
D O I
10.1007/s00521-023-08476-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named entity recognition (NER) has always been an important research task in information extraction and knowledge graph construction. Due to the randomness of Chinese user-generated reviews, character substitution and informal expression are very common. Its widespread phenomenon leads to that Chinese car reviews NER is still a major challenge. In this paper, we propose a joint multi-view character embedding model for Chinese NER (JMCE-CNER) of car reviews. Firstly, deeper character features are extracted from pronunciation, radical, and glyph views to generate the multi-view character embedding. Secondly, a car domain dictionary is constructed for providing accurate word-level information. Thirdly, the multi-view character embedding and the word-level embedding are jointly fed into the deep learning model to perform the Chinese car reviews NER. The experimental datasets of Chinese car reviews are obtained by manual annotation, containing four types of entities, namely brand, model, attribute and structure of the car. The experimental results on the Chinese car review datasets demonstrate that our proposed model achieves the optimal performance compared with the other state-of-the-art models. Furthermore, the model substantially reduces the impact of character substitution and informal expression on performing NER tasks.
引用
收藏
页码:14947 / 14962
页数:16
相关论文
共 60 条
[1]  
Akbik A, 2018, P 27 INT C COMP LING, P1638
[2]   CWI: A multimodal deep learning approach for named entity recognition from social media using character, word and image features [J].
Asgari-Chenaghlu, Meysam ;
Feizi-Derakhshi, M. Reza ;
Farzinvash, Leili ;
Balafar, M. A. ;
Motamed, Cina .
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (03) :1905-1922
[3]   Analysis of named entity recognition and linking for tweets [J].
Derczynski, Leon ;
Maynard, Diana ;
Rizzo, Giuseppe ;
van Erp, Marieke ;
Gorrell, Genevieve ;
Troncy, Raphael ;
Petrak, Johann ;
Bontcheva, Kalina .
INFORMATION PROCESSING & MANAGEMENT, 2015, 51 (02) :32-49
[4]  
Devlin J, 2019, Arxiv, DOI [arXiv:1810.04805, 10.48550/arxiv.1810.04805]
[5]  
Ding ZM, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P5434
[6]   Character-Based LSTM-CRF with Radical-Level Features for Chinese Named Entity Recognition [J].
Dong, Chuanhai ;
Zhang, Jiajun ;
Zong, Chengqing ;
Hattori, Masanori ;
Di, Hui .
NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 :239-250
[7]  
Peters ME, 2018, Arxiv, DOI [arXiv:1802.05365, DOI 10.18653/V1/N18-1202, DOI 10.48550/ARXIV.1802.05365]
[8]   Referent graph embedding model for name entity recognition of Chinese car reviews [J].
Fang, Zhao ;
Zhang, Qiang ;
Kok, Stanley ;
Li, Ling ;
Wang, Anning ;
Yang, Shanlin .
KNOWLEDGE-BASED SYSTEMS, 2021, 233
[9]   VITERBI ALGORITHM [J].
FORNEY, GD .
PROCEEDINGS OF THE IEEE, 1973, 61 (03) :268-278
[10]  
Gaio M., 2017, P 9 INT C ADV GEOGR, P15