Creating Knowledge Graph of Electric Power Equipment Faults Based on BERT-BiLSTM-CRF Model

被引:111
作者
Meng, Fanqi [1 ,2 ]
Yang, Shuaisong [1 ]
Wang, Jingdong [1 ]
Xia, Lei [1 ]
Liu, Han [1 ]
机构
[1] Northeast Elect Power Univ, Sch Comp Sci, Jilin 132012, Jilin, Peoples R China
[2] Guangdong Atv Acad Performing Arts, Sch Informat Engn, Dongguan 523710, Guangdong, Peoples R China
关键词
Knowledge graph; Electric power equipment; Fault diagnosis; Entity recognition; Relation extraction; NETWORKS;
D O I
10.1007/s42835-022-01032-3
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Creating a large-scale knowledge graph of electric power equipment faults will facilitate the development of automatic fault diagnosis and intelligent question answering (QA) in the electric power industry. However, most existing methods have lower accuracy in Chinese entity recognition, thus it is hard to build such a high-quality knowledge graph by extracting knowledge from Chinese technical literature. To solve the problem, a novel model called BERT-BiLSTM-CRF is proposed. It blends Bi-directional Encoder Representation from Transformers (BERT), Bi-directional Long Short-Term Memory (BiLSTM), and Conditional Random Field (CRF). The model firstly identifies and extracts electric power equipment entities from pre-processed Chinese technical literature. Then, the semantic relations between the entities are extracted based on the relation classification method based on dependency parsing. Finally, the extracted knowledge is stored in the Neo4j database in the form of the triplet and visualized in the form of a graph. Through the above steps, a Chinese knowledge graph of electric power equipment faults can be built. The novelty of the model just lies in its subtle blend: the BERT module can not only learn phrase-level information representation, but also learn rich semantic information features; the CRF module realizes the constraint on the label prediction value and reduces the irregular recognition rate, so the accuracy rate of entity recognition is improved. Taking the Chinese technological literature, which is about fault diagnosis of electric power equipment as the experimental object, the experimental results show that the model identifies and extracts Chinese entities more accurately than traditional methods. Thus, a comprehensive and accurate Chinese knowledge graph of electric power equipment faults could be constructed more easily.
引用
收藏
页码:2507 / 2516
页数:10
相关论文
共 26 条
[1]  
[Anonymous], Proceedings of the 2008 ACM SIGMOD international conference on Management of data, SIGMOD '08
[2]  
[Anonymous], 2016, Journal of Medical Informatics, DOI [10.3969/j.issn.1673-6036.2016.04.002, DOI 10.3969/J.ISSN.1673-6036.2016.04.002]
[3]  
[Anonymous], 2007, IJCAI
[4]   Knowledge graph prediction of unknown adverse drug reactions and validation in electronic health records [J].
Bean, Daniel M. ;
Wu, Honghan ;
Dzahini, Olubanke ;
Broadbent, Matthew ;
Stewart, Robert ;
Dobson, Richard J. B. .
SCIENTIFIC REPORTS, 2017, 7
[5]   DBpedia - A crystallization point for the Web of Data [J].
Bizer, Christian ;
Lehmann, Jens ;
Kobilarov, Georgi ;
Auer, Soeren ;
Becker, Christian ;
Cyganiak, Richard ;
Hellmann, Sebastian .
JOURNAL OF WEB SEMANTICS, 2009, 7 (03) :154-165
[6]  
Cheng B, 2018, INT C NETW INFR DIG
[7]  
Chenghu L, 2015, CONSTRUCTION APPL RE
[8]  
Forjan M, 2012, 11TH IFAC/IEEE INTERNATIONAL CONFERENCE ON PROGRAMMABLE DEVICES AND EMBEDDED SYSTEMS (PDES 2012)
[9]   Named entity recognition in electronic health records using transfer learning bootstrapped Neural Networks [J].
Gligic, Luka ;
Kormilitzin, Andrey ;
Goldberg, Paul ;
Nevado-Holgado, Alejo .
NEURAL NETWORKS, 2020, 121 :132-139
[10]  
Guo H, 2005, NATURAL LANGUAGE PRO