Research on coreference resolution technology of entity in information security

被引:0
作者
Zhang H. [1 ,2 ]
Hu Y. [1 ]
Guo Y. [1 ]
Chen J. [3 ]
机构
[1] Department of Cryptogram Engineering, Information Engineering University, Zhengzhou
[2] Software College, Zhengzhou University, Zhengzhou
[3] Institute of information technology, Information Engineering University, Zhengzhou
来源
Tongxin Xuebao/Journal on Communications | 2020年 / 41卷 / 02期
基金
中国国家自然科学基金;
关键词
BiLSTM-attention-CRF; Coreference resolution; Domain-dictionary matching mechanism; Hybrid method; Information security;
D O I
10.11959/j.issn.1000-436x.2020033
中图分类号
学科分类号
摘要
To solve the problem of coreference resolution in information security, a hybrid method was proposed. Based on the BiLSTM-attention-CRF model, the domain-dictionary matching mechanism was introduced and combined with the attention mechanism at the document level. As a new dictionary-based attention mechanism, the word features were calculated to solve the problem of weak recognition ability of rare entities and entities with long length when extracting candidates from text. And by summarizing the features of the domain texts, the candidates were coreferenced by rules and machine learning according to the part of speech to improve the accuracy. Through the experiments on security data set, the superiority of the method is proved from the aspects of coreference resolution and extraction of candidates from text. © 2020, Editorial Board of Journal on Communications. All right reserved.
引用
收藏
页码:165 / 175
页数:10
相关论文
共 37 条
[1]  
Sukthanker R., Poria S., Cambria E., Et al., Anaphora and coreference resolution: a review, (2018)
[2]  
Vaswani A., Bengio S., Brevdo E., Et al., Tensor2tensor for neural machine translation, (2018)
[3]  
Lample G., Ott M., Conneau A., Et al., Phrase-based & neural unsupervised machine translation, (2018)
[4]  
Chen M.X., Firat O., Bapna A., Et al., The best of both worlds: Combining recent advances in neural machine translation, (2018)
[5]  
Cambria E., Poria S., Hazarika D., Et al., SenticNet 5: discovering conceptual primitives for sentiment analysis by means of context embeddings, Thirty-Second AAAI Conference on Artificial Intelligence, pp. 1795-1802, (2018)
[6]  
Etter M., Colleoni E., Illia L., Et al., Measuring organizational legitimacy in social media: assessing citizens' judgments with sentiment analysis, Business & Society, 57, 1, pp. 60-97, (2018)
[7]  
Ma Y., Peng H., Cambria E., Targeted aspect-based sentiment analysis via embedding commonsense knowledge into an attentive LSTM, Thirty-Second AAAI Conference on Artificial Intelligence, pp. 5876-5883, (2018)
[8]  
Zeng D., Dai Y., Li F., Et al., Adversarial learning for distant supervised relation extraction, Computers, Materials & Continua, 55, 1, pp. 121-136, (2018)
[9]  
Gabor K., Buscaldi D., Schumann A.K., Et al., Semeval-2018 task 7: Semantic relation extraction and classification in scientific papers, The 12th International Workshop on Semantic Evaluation, pp. 679-688, (2018)
[10]  
Qin P., Xu W., Wang W.Y., DSGAN: generative adversarial training for distant supervision relation extraction, (2018)