Learning to suggest questions in social media

被引:9
作者
Zhou, Tom Chao [1 ]
Lyu, Michael Rung-Tsong [2 ,3 ]
King, Irwin [2 ,3 ]
Lou, Jie [4 ]
机构
[1] Baidu Inc, Shenzhen, Peoples R China
[2] Chinese Univ Hong Kong, Shenzhen Key Lab Rich Media Big Data Analyt & App, Shenzhen Res Inst, Shenzhen, Peoples R China
[3] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Shatin, Hong Kong, Peoples R China
[4] City Univ Hong Kong, Dept Informat Syst, Kowloon Tong, Hong Kong, Peoples R China
关键词
Social media; Online forum; Community-based Q&A; Question suggestion; Language model; Topic modeling; Q-AND-A; KNOWLEDGE; ANSWERS; MODELS;
D O I
10.1007/s10115-014-0737-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media systems with Q&A functionalities have accumulated large archives of questions and answers. Two representative types are online forums and community-based Q&A services. To enable users to explore the large number of questions and answers in social media systems effectively, it is essential to suggest interesting items to an active user. In this article, we address the problem of question suggestion, which targets at suggesting questions that are semantically related to a queried question. Existing bag-of-words approaches suffer from the shortcoming that they could not bridge the lexical chasm between semantically related questions. Therefore, we present a new framework, and propose the topic-enhanced translation-based language model (TopicTRLM), which fuses both the lexical and latent semantic knowledge. This fusing enables TopicTRLM to find semantically related questions to a given question even when there is little word overlap. Moreover, to incorporate the answer information into the model to make the model more complete, we also propose the topic-enhanced translation-based language model with answer ensemble. Extensive experiments have been conducted with real-world datasets. Experimental results indicate our approach is very effective and outperforms other popular methods in several metrics.
引用
收藏
页码:389 / 416
页数:28
相关论文
共 72 条
[1]  
Agichtein E, 2001, P 10 INT C WORLD WID
[2]   Modeling Information-Seeker Satisfaction in Community Question Answering [J].
Agichtein, Eugene ;
Liu, Yandong ;
Bian, Jiang .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2009, 3 (02)
[3]  
[Anonymous], 2005, PARAMETER ESTIMATION
[4]  
[Anonymous], P 17 INT C WORLD WID
[5]  
[Anonymous], P 14 ACM INT C INF K
[6]  
[Anonymous], P 21 ANN INT ACM SIG
[7]  
[Anonymous], 2008, P WWW
[8]  
[Anonymous], 2003, ICML
[9]  
[Anonymous], 2000, P 23 ANN INT ACM SIG
[10]  
[Anonymous], P 29 ANN INT ACM SIG