Community answer generation based on knowledge graph

被引:22
作者
Wu, Yongliang [1 ]
Zhao, Shuliang [2 ,3 ,4 ]
机构
[1] Hebei Normal Univ, Sch Math Sci, Shijiazhuang 050024, Hebei, Peoples R China
[2] Hebei Normal Univ, Coll Comp & Cyber Secur, Shijiazhuang 050024, Hebei, Peoples R China
[3] Hebei Prov Key Lab Network & Informat Secur, Shijiazhuang 050024, Hebei, Peoples R China
[4] Hebei Prov Engn Res Ctr Supply Chain Big Data Ana, Shijiazhuang 050024, Hebei, Peoples R China
关键词
Community question answering; Answer generation; Phrase embedding; Knowledge graph; Phrase mining; QUESTIONS; SYSTEMS;
D O I
10.1016/j.ins.2020.07.077
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Community Question Answering (CQA) has become an indispensable way for modern people to share and acquire knowledge. It allows users to ask questions, which will be answered by experienced users enthusiastically. By recording user operation logs, CQA has accumulated a large amount of valuable and complex data. However, askers must wait (usually for a long time) until other expert users answer their questions on social platforms. This will seriously affect the user experience. In this paper, we propose a Community Answer Generation method based on the Knowledge Graph, called CAGKG, to generate natural language answers automatically. Firstly, we extract the core phrases of posts to represent their semantics relations. Then, we model the user's knowledge background based on their action records. Finally, we query knowledge entities in a knowledge graph based on user background and question semantics, then convert them into natural language answers. Besides, we proposed a Phrase-based Answers Semantic Similarity Evaluation indicator, called PASSE, which focuses on the semantic similarity between texts instead of literal matching. To the best of our knowledge, it is the first work that utilizes the user knowledge and text semantics to improve the performance of CQA. Experiments on four real datasets (Stack Overflow, Super User, Mathematics, and Quora) show that CAGKG is superior to the state-of-the-art question answering frameworks. Compared with other answer evaluation indicators, PASSE is a promising indicator for evaluating semantic similarity. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:132 / 152
页数:21
相关论文
共 50 条
[1]  
Araki Jun, 2016, P COLING 2016 26 INT, P1125
[2]   Text Generation From Tables [J].
Bao, Junwei ;
Tang, Duyu ;
Duan, Nan ;
Yan, Zhao ;
Zhou, Ming ;
Zhao, Tiejun .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (02) :311-320
[3]  
Bing Li, 2016, Database Systems for Advanced Applications. 21st International Conference, DASFAA 2016. Proceedings: LNCS 9642, P197, DOI 10.1007/978-3-319-32025-0_13
[4]  
Biswas P, 2005, I CONF VLSI DESIGN, P651
[5]   A Multi-Objective Optimization Approach for Question Routing in Community Question Answering Services [J].
Cheng, Xiang ;
Zhu, Shuguang ;
Su, Sen ;
Chen, Gang .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (09) :1779-1792
[6]  
Datla V.V., 2017, P 26 TEXT RETRIEVAL
[7]   Core techniques of question answering systems over knowledge bases: a survey [J].
Diefenbach, Dennis ;
Lopez, Vanessa ;
Singh, Kamal ;
Maret, Pierre .
KNOWLEDGE AND INFORMATION SYSTEMS, 2018, 55 (03) :529-569
[8]  
Doddington G., 2002, P 2 INT C HUM LANG T, P138, DOI DOI 10.3115/1289189.1289273
[9]  
Eriguchi A, 2019, COMPUT LINGUIST, V45, P267, DOI [10.1162/coli_a_00348, 10.1162/COLI_a_00348]
[10]  
Fang HY, 2016, AAAI CONF ARTIF INTE, P122