Regions, Periods, Activities: Uncovering Urban Dynamics via Cross-Modal Representation Learning

被引:101
作者
Zhang, Chao [1 ]
Zhang, Keyang [1 ]
Yuan, Quan [1 ]
Peng, Haoruo [1 ]
Zheng, Yu [2 ]
Hanratty, Tim [3 ]
Wang, Shaowen [4 ,5 ]
Han, Jiawei [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[2] Microsoft Res, Beijing, Peoples R China
[3] US Army Res Lab, Adelphi, MD USA
[4] Univ Illinois, Dept Geog, Urbana, IL USA
[5] Univ Illinois, GIS, Urbana, IL USA
来源
PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17) | 2017年
基金
美国国家科学基金会;
关键词
Twitter; urban dynamics; activity; representation learning; social media; spatiotemporal data; geographical topic;
D O I
10.1145/3038912.3052601
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the ever-increasing urbanization process, systematically modeling people's activities in the urban space is being recognized as a crucial socioeconomic task. This task was nearly impossible years ago due to the lack of reliable data sources, yet the emergence of geo-tagged social media (GTSM) data sheds new light on it. Recently, there have been fruitful studies on discovering geographical topics from GTSM data. However, their high computational costs and strong distributional assumptions about the latent topics hinder them from fully unleashing the power of GTSM. To bridge the gap, we present CrossMap, a novel cross-modal representation learning method that uncovers urban dynamics with massive GTSM data. CrossMap first employs an accelerated mode seeking procedure to detect spatiotemporal hotspots underlying people's activities. Those detected hotspots not only address spatiotemporal variations, but also largely alleviate the sparsity of the GTSM data. With the detected hotspots, CrossMap then jointly embeds all spatial, temporal, and textual units into the same space using two different strategies: one is reconstruction-based and the other is graph-based. Both strategies capture the correlations among the units by encoding their co-occurrence and neighborhood relationships, and learn low-dimensional representations to preserve such correlations. Our experiments demonstrate that CrossMap not only significantly outperforms state-of-the-art methods for activity recovery and classification, but also achieves much better efficiency.
引用
收藏
页码:361 / 370
页数:10
相关论文
共 42 条
[1]   EvenTweet: Online Localized Event Detection from Twitter [J].
Abdelhaq, Flamed ;
Sengstock, Christian ;
Gertz, Michael .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (12) :1326-1329
[2]  
[Anonymous], 2011, Power Electronics: Power Electronic Conversion and Control Technology
[3]  
[Anonymous], 2011, P INT AAAI C WEB SOC
[4]  
[Anonymous], 1970, UCLA WORK PAP PHONET
[5]  
[Anonymous], 2006, P 15 INT C WORLD WID
[6]  
[Anonymous], 2014, PROC 20 ACM SIGKDD, DOI DOI 10.1145/2623330.2623732
[7]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[8]  
Carreira-Perpinan M.A., 2006, IEEE Comp Soc Conf Comp Vis Patt Recog, P1160, DOI DOI 10.1109/CVPR.2006.44
[9]  
Chen L., 2009, CIKM, P523
[10]  
Cho E., 2011, P 17 ACM SIGKDD INT, P1082, DOI [10.1145/2020408.2020579, DOI 10.1145/2020408.2020579]