Joint multi-view character embedding model for named entity recognition of Chinese car reviews

被引:4
作者
Ding, Jiaming [1 ,2 ]
Xu, Wenping [3 ]
Wang, Anning [1 ,2 ]
Zhao, Shuangyao [1 ,2 ]
Zhang, Qiang [1 ,2 ]
机构
[1] Hefei Univ Technol, Sch Management, Hefei 230009, Peoples R China
[2] Minist Educ, Key Lab Proc Optimizat & Intelligent Decismaking, Hefei 230009, Peoples R China
[3] Weichai Power Co Ltd, Weifang 261061, Peoples R China
基金
中国国家自然科学基金;
关键词
Named entity recognition; Multi-view character embedding; Domain-specific knowledge; Deep learning; Natural language processing;
D O I
10.1007/s00521-023-08476-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named entity recognition (NER) has always been an important research task in information extraction and knowledge graph construction. Due to the randomness of Chinese user-generated reviews, character substitution and informal expression are very common. Its widespread phenomenon leads to that Chinese car reviews NER is still a major challenge. In this paper, we propose a joint multi-view character embedding model for Chinese NER (JMCE-CNER) of car reviews. Firstly, deeper character features are extracted from pronunciation, radical, and glyph views to generate the multi-view character embedding. Secondly, a car domain dictionary is constructed for providing accurate word-level information. Thirdly, the multi-view character embedding and the word-level embedding are jointly fed into the deep learning model to perform the Chinese car reviews NER. The experimental datasets of Chinese car reviews are obtained by manual annotation, containing four types of entities, namely brand, model, attribute and structure of the car. The experimental results on the Chinese car review datasets demonstrate that our proposed model achieves the optimal performance compared with the other state-of-the-art models. Furthermore, the model substantially reduces the impact of character substitution and informal expression on performing NER tasks.
引用
收藏
页码:14947 / 14962
页数:16
相关论文
共 60 条
[31]  
Liu Z., 2020, DECIS SUPPORT SYST, V143
[32]  
Liu ZX, 2010, LECT NOTES ARTIF INT, V6216, P634
[33]   An attention-based BiLSTM-CRF approach to document-level chemical named entity recognition [J].
Luo, Ling ;
Yang, Zhihao ;
Yang, Pei ;
Zhang, Yin ;
Wang, Lei ;
Lin, Hongfei ;
Wang, Jian .
BIOINFORMATICS, 2018, 34 (08) :1381-1388
[34]  
Ma XZ, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P1064
[35]   Identifying gene and protein mentions in text using conditional random fields [J].
McDonald, R ;
Pereira, F .
BMC BIOINFORMATICS, 2005, 6 (Suppl 1)
[36]  
Mikolov Tomas, 2013, Advances in Neural Information Processing Systems, P3111
[37]  
Morwal S., 2012, International Journal on Natural Language Computing (IJNLC), V1, P15, DOI DOI 10.5121/IJNLC.2012.1402
[38]   Incorporating token-level dictionary feature into neural model for named entity recognition [J].
Mu Xiaofeng ;
Wang Wei ;
Xu Aiping .
NEUROCOMPUTING, 2020, 375 :43-50
[39]   Borrowing wisdom from world: modeling rich external knowledge for Chinese named entity recognition [J].
Nie, Yu ;
Zhang, Yilai ;
Peng, Yongkang ;
Yang, Lisha .
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (06) :4905-4922
[40]  
Pande S. D., 2022, Machine Learning Applications in Engineering Education and Management, V2, P30