Understanding the User Behavior of Foursquare: A Data-Driven Study on a Global Scale

被引:9
作者
Chen, Yang [1 ,2 ,3 ]
Hu, Jiyao [1 ,2 ,3 ]
Xiao, Yu [4 ]
Li, Xiang [5 ]
Hui, Pan [6 ,7 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
[2] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China
[3] Peng Cheng Lab, Shenzhen 518066, Peoples R China
[4] Aalto Univ, Dept Commun & Networking, Espoo 02150, Finland
[5] Fudan Univ, Sch Informat Sci & Technol, Shanghai 200433, Peoples R China
[6] Univ Helsinki, Dept Comp Sci, Helsinki 00100, Finland
[7] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金; 芬兰科学院;
关键词
Twitter; Publishing; Distributed databases; Sociology; Statistics; Systematics; IEEE Foundation; Data-driven study; location-based social networks (LBSNs); machine learning; social graph analysis; social influence; tips; ONLINE SOCIAL NETWORKS; POPULARITY; POWER;
D O I
10.1109/TCSS.2020.2992294
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Being a leading online service providing both local search and social networking functions, Foursquare has attracted tens of millions of users all over the world. Understanding the user behavior of Foursquare is helpful to gain insights for location-based social networks (LBSNs). Most of the existing studies focus on a biased subset of users, which cannot give a representative view of the global user base. Meanwhile, although the user-generated content (UGC) is very important to reflect user behavior, most of the existing UGC studies of Foursquare are based on the check-ins. There is a lack of a thorough study on tips, the primary type of UGC on Foursquare. In this article, by crawling and analyzing the global social graph and all published tips, we conduct the first comprehensive user behavior study of all 60+ million Foursquare users around the world. We have made the following three main contributions. First, we have found several unique and undiscovered features of the Foursquare social graph on a global scale, including a moderate level of reciprocity, a small average clustering coefficient, a giant strongly connected component, and a significant community structure. Besides the singletons, most of the Foursquare users are weakly connected with each other. Second, we undertake a thorough investigation according to all published tips on Foursquare. We start from counting the numbers of tips published by different users and then look into the tip contents from the perspectives of tip venues, temporal patterns, and sentiment. Our results provide an informative picture of the tip publishing patterns of Foursquare users. Last but not least, as a practical scenario to help third-party application providers, we propose a supervised machine learning-based approach to predict whether a user is an influential by referring to the profile and UGC, instead of relying on the social connectivity information. Our data-driven evaluation demonstrates that our approach can reach a good prediction performance with an F1-score of 0.87 and an AUC value of 0.88. Our findings provide a systematic view of the behavior of Foursquare users and are constructive for different relevant entities, including LBSN service providers, Internet service providers, and third-party application providers.
引用
收藏
页码:1019 / 1032
页数:14
相关论文
共 70 条
  • [61] Wang G., 2016, P IEEE INT C RAD, P1
  • [62] Watanabe M, 2013, PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), P531
  • [63] Weng J., 2010, P 3 ACM INT C WEB SE, P261
  • [64] Wilson C, 2009, EUROSYS'09: PROCEEDINGS OF THE FOURTH EUROSYS CONFERENCE, P205
  • [65] We Know Your Preferences an New Cities: Mining and Modeling the Behavior of Travelers
    Xie, Rong
    Chen, Yang
    Xie, Qinge
    Xiao, Yu
    Wang, Xin
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (11) : 28 - 35
  • [66] Context-aware Real-time Population Estimation for Metropolis
    Xu, Fengli
    Feng, Jie
    Zhang, Pengyu
    Li, Yong
    [J]. UBICOMP'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING, 2016, : 1064 - 1075
  • [67] Xu TY, 2011, LECT NOTES COMPUT SC, V7049, P20
  • [68] Yang C., 2018, BIG DATA MINING ANAL, V1, P284
  • [69] Zhao X., 2012, P ACM C INT MEAS C I, P171
  • [70] JPR: Exploring Joint Partitioning and Replication for Traffic Minimization in Online Social Networks
    Zhou, Jingya
    Fan, Jianxi
    [J]. 2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 1147 - 1156