Understanding demographic and socioeconomic biases of geotagged Twitter users at the county level

被引:82
作者
Jiang, Yuqin [1 ]
Li, Zhenlong [1 ]
Ye, Xinyue [2 ]
机构
[1] Univ South Carolina, Dept Geog, Columbia, SC 29208 USA
[2] Kent State Univ, Dept Geog, Kent, OH 44240 USA
关键词
Social media; population bias; geographically weighted regression; spatial statistics; big data; GEOGRAPHICALLY WEIGHTED REGRESSION; SOCIAL MEDIA;
D O I
10.1080/15230406.2018.1434834
中图分类号
P9 [自然地理学]; K9 [地理];
学科分类号
0705 ; 070501 ;
摘要
Massive social media data produced from microblog platforms provide a new data source for studying human dynamics at an unprecedented scale. Meanwhile, population bias in geotagged Twitter users is widely recognized. Understanding the demographic and socioeconomic biases of Twitter users is critical for making reliable inferences on the attitudes and behaviors of the population. However, the existing global models cannot capture the regional variations of the demographic and socioeconomic biases. To bridge the gap, we modeled the relationships between different demographic/socioeconomic factors and geotagged Twitter users for the whole contiguous United States, aiming to understand how the demographic and socioeconomic factors relate to the number of Twitter users at county level. To effectively identify the local Twitter users for each county of the United States, we integrate three commonly used methods and develop a query approach in a high-performance computing environment. The results demonstrate that we can not only identify how the demographic and socioeconomic factors relate to the number of Twitter users, but can also measure and map how the influence of these factors vary across counties.
引用
收藏
页码:228 / 242
页数:15
相关论文
共 47 条
[1]  
[Anonymous], 2015, ISPRS INT J GEO-INF, DOI DOI 10.3390/JIGI4031549
[2]   Geographically weighted regression - modelling spatial non-stationarity [J].
Brunsdon, C ;
Fotheringham, S ;
Charlton, M .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1998, 47 :431-443
[3]   Geographically weighted regression: A method for exploring spatial nonstationarity [J].
Brunsdon, C ;
Fotheringham, AS ;
Charlton, ME .
GEOGRAPHICAL ANALYSIS, 1996, 28 (04) :281-298
[4]  
Charlton M., 2009, Geographically Weighted Regression
[5]  
Cheng Z., 2010, CIKM 10, P759, DOI DOI 10.1145/1871437.1871535
[6]  
Chu Z, 2010, 26TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE (ACSAC 2010), P21
[7]  
Dickerson JP, 2014, 2014 PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2014), P620, DOI 10.1109/ASONAM.2014.6921650
[8]  
ESRI, 2017, INT OLS RES
[9]  
Ester M., 1996, P 2 INT C KNOWL DISC
[10]   Crowdsourcing Dialect Characterization through Twitter [J].
Goncalves, Bruno ;
Sanchez, David .
PLOS ONE, 2014, 9 (11)