Detecting global and local topics via mining twitter data

被引:12
作者
Liu, Huan [1 ]
Ge, Yong [2 ]
Zheng, Qinghua [1 ]
Lin, Rongcheng [3 ]
Li, Huayu [3 ]
机构
[1] Xi An Jiao Tong Univ, Dept Comp Sci & Technol, MOEKLINNS Lab, Xian, Shaanxi, Peoples R China
[2] Nanjing Univ Finance & Econ, Coll Informat Engn, Nanjing, Jiangsu, Peoples R China
[3] Univ North Carolina Charlotte, Dept Comp Sci, Charlotte, NC USA
基金
中国国家自然科学基金;
关键词
Social event; Probabilistic graphical model; Twitter; Global and local topic; EVENT DETECTION;
D O I
10.1016/j.neucom.2017.07.056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting topics from Twitter has been widely studied for understanding social events. There are two types of topics, i.e., global topics attracting widespread tweets with larger volume and local topics drawing attention of limited tweets of somewhere. However, most of existent works neglect the difference between them and suffer from the Long Tail Effect, resulting in the inability to detect the local one. In this paper, we distinguish global and local topics by associating each tweet with both of them simultaneously. We propose a probabilistic graphical model to extract global and local topics related to social events in a unified framework at the same time. Our model learns global topics using tweets scattered around all locations, while studies local topics merely utilizing tweets within the corresponding location. We collect two tweet datasets on Twitter from several cities in USA and evaluate our model over them. The experimental results show significant improvement of our model compared to baseline methods. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:120 / 132
页数:13
相关论文
共 34 条
[1]  
[Anonymous], P 10 INT WORKSHOP MU, DOI [DOI 10.1145/1814245.1814249, 10.1145/1814245.1814249]
[2]  
[Anonymous], 2008, CROWDSOURCING POWER
[3]  
[Anonymous], 2011, P INT AAAI C WEB SOC
[4]  
[Anonymous], 2012, P 2012 SIAM INT C DA
[5]  
[Anonymous], 2011, Proceedings of the 17th ACM SIGKDD Interna- tional Conference on Knowledge Discovery and Data Mining
[6]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[7]  
Chang XJ, 2015, AAAI CONF ARTIF INTE, P2532
[8]   Bi-Level Semantic Representation Analysis for Multimedia Event Detection [J].
Chang, Xiaojun ;
Ma, Zhigang ;
Yang, Yi ;
Zeng, Zhiqiang ;
Hauptmann, Alexander G. .
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (05) :1180-1197
[9]   Non-Parametric Scan Statistics for Event Detection and Forecasting in Heterogeneous Social Media Graphs [J].
Chen, Feng ;
Neill, Daniel B. .
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :1166-1175
[10]   Road Traffic Congestion Monitoring in Social Media with Hinge-Loss Markov Random Fields [J].
Chen, Po-Ta ;
Chen, Feng ;
Qian, Zhen .
2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, :80-89