Detecting Social Topic by Hashtag-Weighted Topic Model over Time

被引:0
作者
Qiu, Jie [1 ]
Li, Li [1 ]
机构
[1] Southwest Univ, Sch Comp & Informat Sci, Chongqing 400715, Peoples R China
来源
PROCEEDINGS OF THE 2016 4TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS AND INFORMATION TECHNOLOGY APPLICATIONS | 2016年 / 71卷
关键词
Hashtag-weighted; Topic model; Twitter;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, more and more social media platforms support hashtags to facilitate information classification. Like Twitter hashtags, a user-initiated hashtag can suggest emotion/mood, convey so much extra information in addition to the actual tweet. Hashtags have been widely used in topic analysis because of its informative effect, but all hashtags are created equally. In the paper, we propose a Hashtag-Weighted Topic Model over Time (HWOT) which assigns hashtags to deal with topic evolving over time with different hashtag weight. To leverage hashtags across topics in a specific time period, the topic of hashtag is represented as a multinomial distribution and the topic over time as a Beta distribution. Our model can uncover the latent relationships among topics, hashtags and time. The weight of the hashtag is learned via a novel context aware weakly supervised approach. Experiments on Twitter dataset show that our model can achieve better performance in terms of model perplexity. It further reveals the change of the topics over time.
引用
收藏
页码:1033 / 1038
页数:6
相关论文
共 9 条
  • [1] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [2] A Joint Model for Topic-Sentiment Evolution over Time
    Dermouche, Mohamed
    Velcin, Julien
    Khouas, Leila
    Loudcher, Sabine
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 773 - 778
  • [3] Probabilistic latent semantic indexing
    Hofmann, T
    [J]. SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1999, : 50 - 57
  • [4] Tag-Weighted Dirichlet Allocation
    Li, Shuangyin
    Huang, Guan
    Tan, Ruiyang
    Pan, Rong
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 438 - 447
  • [5] Ramage Daniel., 2009, EMNLP
  • [6] Rosen-Zvi Michal., 2004, UAI
  • [7] Detecting Hotspot Information Using Multi-Attribute Based Topic Model
    Wang, Jing
    Li, Li
    Tan, Feng
    Zhu, Ying
    Feng, Weisi
    [J]. PLOS ONE, 2015, 10 (10):
  • [8] Wang Xuerui., P 12 ACM SIGKDD INT, P424, DOI DOI 10.1145/1150402.1150450
  • [9] Yan Xiaohui, 2013, P 22 INZ C WORLD WID, P1445, DOI DOI 10.1145/2488388.2488514