User Modeling on Demographic Attributes in Big Mobile Social Networks

被引:23
作者
Dong, Yuxiao [1 ,2 ]
Chawla, Nitesh V. [1 ,2 ]
Tang, Jie [3 ]
Yang, Yang [4 ,5 ,6 ,7 ]
Yang, Yang [4 ,5 ,6 ,7 ]
机构
[1] Univ Notre Dame, Interdisciplinary Ctr Network Sci & Applicat iCeN, Notre Dame, IN 46556 USA
[2] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[3] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[4] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Zhejiang, Peoples R China
[5] Northwestern Univ, Kellogg Sch Management, Evanston, IL 60208 USA
[6] Tsinghua Univ, Beijing, Peoples R China
[7] Univ Notre Dame, Notre Dame, IN 46556 USA
基金
美国国家科学基金会;
关键词
Gender and age; demographic prediction; node attributes; ego networks; social tie and triad; mobile communication; mobile phone data; computational social science; GRAPHS;
D O I
10.1145/3057278
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Users with demographic profiles in social networks offer the potential to understand the social principles that underpin our highly connected world, from individuals, to groups, to societies. In this article, we harness the power of network and data sciences to model the interplay between user demographics and social behavior and further study to what extent users' demographic profiles can be inferred from their mobile communication patterns. By modeling over 7 million users and 1 billion mobile communication records, we find that during the active dating period (i.e., 18-35 years old), users are active in broadening social connections with males and females alike, while after reaching 35 years of age people tend to keep small, closed, and same-gender social circles. Further, we formalize the demographic prediction problem of inferring users' gender and age simultaneously. We propose a factor graph-based WhoAmI method to address the problem by leveraging not only the correlations between network features and users' gender/age, but also the interrelations between gender and age. In addition, we identify a new problem-coupled network demographic prediction across multiple mobile operators- and present a coupled variant of the WhoAmI method to address its unique challenges. Our extensive experiments demonstrate the effectiveness, scalability, and applicability of the WhoAmI methods. Finally, our study finds a greater than 80% potential predictability for inferring users' gender from phone call behavior and 73% for users' age from text messaging interactions.
引用
收藏
页数:33
相关论文
共 59 条
[1]  
Aledavood T., 2015, CHANNEL SPECIFIC DAI
[2]  
[Anonymous], 2009, PNAS
[3]  
[Anonymous], 2011, P 17 ACM SIGKDD INT, DOI DOI 10.1145/2020408.2020581
[4]  
[Anonymous], 1999, P UAI 99
[5]  
[Anonymous], 2011, BASIC STAT ANAL
[6]  
[Anonymous], 2012, Networks, Crowds, and Markets
[7]  
[Anonymous], 2002, HIERARCHICAL LINEAR
[8]  
[Anonymous], 1954, FREEDOM CONTROL MODE
[9]  
[Anonymous], 2008, P 17 INT C WORLD WID, DOI DOI 10.1145/1367497.1367620
[10]  
Bi Bin, 2013, P 22 INT C WORLD WID, P131, DOI DOI 10.1145/2488388.2488401