Multi-perspective Hierarchical Dirichlet Process for Geographical Topic Modeling

被引:4
作者
He, Yuan [1 ]
Wang, Cheng [1 ]
Jian, Changjun [1 ]
机构
[1] Tongji Univ, Dept Comp Sci, Shanghai, Peoples R China
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT I | 2017年 / 10234卷
关键词
Hierarchical Dirichlet Process; Geographical topic modeling;
D O I
10.1007/978-3-319-57454-7_63
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The pervasion of location acquisition technology has strongly propelled the popularity of geo-tagged user-generated content (UGC), which also raises new computational possibility for investigating geographical topics and users' spatial behaviors. This paper proposes a novel method for geographical topic modeling by combining text content with user information and spatial knowledge. Topics are estimated as the interests of users and features of locations. The joint modeling of the three heterogeneous sources (1) leads to high accuracy in predicting visit behaviors driven by personal interests, (2) discovers coherent topic representations for topic modeling, (3) enables the recommender system to suggest interpretable locations. Our framework is flexible to incorporate new dimensions of data such as temporal information without substantially changing the model structure. We also experimentally demonstrate the limitations of the traditional assumption that a topic is selected considerably dependent on the location. In many cases, the published topics are mainly affected by the user's interests rather than the current location. Our model discriminates these two scenarios. Through employing hierarchical Dirichlet process, we also need not predefine the number of topics like other mixture models. Experiments on three different datasets show that our model is effective in discovering spatial topics and significantly outperforms the state of the art.
引用
收藏
页码:811 / 823
页数:13
相关论文
共 26 条
[1]  
[Anonymous], 2012, P 18 ACM SIGKDD INT, DOI 10.1145/2339530.2339562
[2]  
[Anonymous], 2010, EMNLP
[3]  
[Anonymous], 2013, P 23 INT JOINT C ART
[4]  
[Anonymous], 2010, BAYESIAN VIEW POISSO
[5]  
[Anonymous], 2013, RECSYS 13 P 7 ACM C, DOI DOI 10.1145/2507157.2507174
[6]   The Nested Chinese Restaurant Process and Bayesian Nonparametric Inference of Topic Hierarchies [J].
Blei, David M. ;
Griffiths, Thomas L. ;
Jordan, Michael I. .
JOURNAL OF THE ACM, 2010, 57 (02)
[7]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[8]  
Chang J., 2009, ADV NEURAL INFORM PR, P288, DOI DOI 10.5555/2984093.2984126
[9]   Discovering Coherent Topics Using General Knowledge [J].
Chen, Zhiyuan ;
Mukherjee, Arjun ;
Liu, Bing ;
Hsu, Meichun ;
Castellanos, Malu ;
Ghosh, Riddhiman .
PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, :209-218
[10]   BAYESIAN ANALYSIS OF SOME NONPARAMETRIC PROBLEMS [J].
FERGUSON, TS .
ANNALS OF STATISTICS, 1973, 1 (02) :209-230