User graph topic model

被引:3
作者
Akhtar, Nadeem [1 ]
Beg, M. M. Sufyan [1 ]
机构
[1] Aligarh Muslim Univ, Zakir Husain Coll Engn & Technol, Dept Comp Engn, Aligarh 202002, Uttar Pradesh, India
关键词
Topic models; Latent Dirichlet Allocation; user graph; SHORT TEXT; MICROBLOG;
D O I
10.3233/JIFS-169934
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding coherent topics in Twitter data is difficult task because of the sparseness and informal language. Tweets also provide rich contextual and auxiliary metadata which can be used to supervise the topic modeling to get more coherent topics. In this paper, a novel topic model is proposed which extends Author Topic Model for twitter. Standard Author Topic Model cannot be used on Twitter data as every tweet has exactly one author. The proposed User Graph Topic Model (UGTM) considers the semantic relationships among tweet users based on the contextual information like hashtags, user mentions and replies to make a user graph. Related users of author of a tweet are found and used in tweet generation process. Related user information from the user graph is used to obtain the dirichlet prior for user generation. Empirical results show that the proposed UGTM outperforms standard Author Topic Model (ATM) on experimental data.
引用
收藏
页码:2229 / 2240
页数:12
相关论文
共 48 条
[1]  
Akhtar N., 2017, APPL SOFT COMPUT, P83
[2]  
Akhtar N., 2018, CSI J COMPUTING, P19
[3]  
Akhtar N., 2018, DATA MANAGEMENT ANAL, V2, P21
[4]   Aspect based Sentiment Oriented Summarization of Hotel Reviews [J].
Akhtar, Nadeem ;
Zubair, Nashez ;
Kumar, Abhishek ;
Ahmad, Tameem .
7TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2017), 2017, 115 :563-571
[5]  
Alvarez-Melis D., 2016, Proceedings of the 10th International Conference on Web and Social Media, ICWSM 2016, P519
[6]  
[Anonymous], 2009, ARTIFICIAL INTELLIGE
[7]  
[Anonymous], 2008, ADV NEURAL INFORM PR
[8]  
[Anonymous], 2018, TWITTER STREAMING AP
[9]  
[Anonymous], 2008, Introduction to information retrieval
[10]  
[Anonymous], 2005, ENCY BIOSTATISTICS