Named Entity Recognition Method for Process Planning Text

被引:0
作者
Dong H. [1 ]
Li Y. [1 ]
Qiao L. [1 ]
Huang Z. [1 ]
机构
[1] School of Mechanical Engineering & Automation, Beihang University, Beijing
来源
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics | 2024年 / 36卷 / 02期
关键词
bidirectional long short term memory network; conditional random field; knowledge extraction; named entity recognition;
D O I
10.3724/SP.J.1089.2024.19732
中图分类号
学科分类号
摘要
To realize the efficient recognition of critical information in unstructured process planning text, a named entity recognition model based on technology dictionary and neural network is established. Firstly, the technology dictionary and jieba word segmentation technology are comprehensively combined to realize automatic annotation of datasets, especially, the number and its identification letters are recognized as one unit in the automatic annotation of process parameter data, which enhances the effect of subsequent feature extraction. Secondly, the bidirectional long short term memory network is used to extract the feature of text information based on word2vec. Finally, conditional random field model is used to synthesize contextual logic to improve the recognition accuracy of critical process information. To verify the effectiveness of the proposed model, 431 work steps are utilized as training sample. Experimental results show that the values of accuracy rate, recall and F1 are 90.20%, 93.88% and 92.00% respectively, which has certain advantages compared with traditional models in the field. In addition, three experimental datasets from different technology books are tested, the results also show high robustness of the proposed model. © 2024 Institute of Computing Technology. All rights reserved.
引用
收藏
页码:313 / 320
页数:7
相关论文
共 12 条
[1]  
Zhang Faping, Li Li, Research on knowledge push method for business process based on multidimensional hierarchical context model, Journal of Computer-Aided Design & Computer Graphics, 29, 4, pp. 751-758, (2017)
[2]  
Tang Jiayu, Liu Zhiyuan, Sun Maosong, A survey of text visualization, Journal of Computer-Aided Design & Computer Graphics, 25, 3, pp. 273-285, (2013)
[3]  
Zhang Y J, Xu Z T, Zhang T., Fusion of multiple features for Chinese named entity recognition based on CRF model, Proceedings of the 4th Asia Information Retrieval Symposium, pp. 95-106, (2008)
[4]  
Sun S C, Yun J, Lin H F, Et al., Granular transfer learning using type-2 fuzzy HMM for text sequence recognition, Neurocomputing, 214, pp. 126-133, (2016)
[5]  
Collobert R, Weston J, Bottou L, Et al., Natural language processing (almost) from scratch, The Journal of Machine Learning Research, 12, 1, pp. 2493-2537, (2011)
[6]  
Khalifa M, Shaalan K., Character convolutions for Arabic named entity recognition with long short-term memory networks, Computer Speech & Language, 58, pp. 335-346, (2019)
[7]  
Yu S, Cheng Y, Xie L, Et al., A novel recurrent hybrid network for feature fusion in action recognition, Journal of Visual Communication and Image Representation, 49, 11, pp. 192-203, (2017)
[8]  
Liu Yupeng, Li Dongdong, Chinese named entity recognition method based on Bi-directional LSTM-CNN-CRF, Journal of Harbin University of Science and Technology, 25, 1, pp. 115-120, (2020)
[9]  
Liu J X, Wu F Z, Wu C H, Et al., Neural Chinese word segmentation with dictionary knowledge, Proceedings of CCF International Conference on Natural Language Processing and Chinese Computing, pp. 80-91, (2018)
[10]  
Chen Yiren, He Kunjin, Lu Fengwei, Et al., Discrimination of fracture types based on neural network and voxelized bone template, Journal of Computer-Aided Design & Computer Graphics, 33, 8, pp. 1295-1307, (2021)