Knowing the Tweeters: Deriving Sociologically Relevant Demographics from Twitter

被引:106
作者
Sloan, Luke [1 ]
Morgan, Jeffrey
Housley, William
Williams, Matthew
Edwards, Adam
Burnap, Pete
Rana, Omer
机构
[1] Cardiff Univ, Sch Social Sci, Cardiff CF10 3AX, S Glam, Wales
来源
SOCIOLOGICAL RESEARCH ONLINE | 2013年 / 18卷 / 03期
关键词
New Social Media; Demographics; Twitter; Social Media Analytics; Social Science; Sampling; GENDER IDENTIFICATION; PRIVACY; CRISIS;
D O I
10.5153/sro.3001
中图分类号
C91 [社会学];
学科分类号
030301 ; 1204 ;
摘要
A perennial criticism regarding the use of social media in social science research is the lack of demographic information associated with naturally occurring mediated data such as that produced by Twitter. However the fact that demographics information is not explicit does not mean that it is not implicitly present. Utilising the Cardiff Online Social Media Observatory (COSMOS) this paper suggests various techniques for establishing or estimating demographic data from a sample of more than 113 million Twitter users collected during July 2012. We discuss in detail the methods that can be used for identifying gender and language and Illustrate that the proportion of males and females using Twitter in the UK reflects the gender balance observed in the 2011 Census. We also expand on the three types of geographical information that can be derived from Tweets either directly or by proxy and how spatial information can be used to link social media with official curated data. Whilst we make no grand claims about the representative nature of Twitter users in relation to the wider UK population, the derivation of demographic data demonstrates the potential of new social media (NSM) for the social sciences. We consider this paper a clarion call and hope that other researches test the methods we suggest and develop the further.
引用
收藏
页码:74 / 84
页数:11
相关论文
共 45 条
  • [1] [Anonymous], 2012, ETHICAL DECISION MAK
  • [2] [Anonymous], PLACEFINDER
  • [3] Argamon S., 2006, TEXT, V23, P321
  • [4] ASUR S., 2010, PREDICTING THE FUTUR
  • [5] BARRACUDA LABS Internet Security Blog, 2012, THE TWITTER UNDERGRO
  • [6] BRUNS A., 2012, RESEARCH REPORT ARC
  • [7] Mapping the Australian Networked Public Sphere
    Bruns, Axel
    Burgess, Jean
    Highfield, Tim
    Kirchhoff, Lars
    Nicolai, Thomas
    [J]. SOCIAL SCIENCE COMPUTER REVIEW, 2011, 29 (03) : 277 - 287
  • [8] Burgess J., 2012, M/C Journal, V15, DOI DOI 10.5204/MCJ.561
  • [9] Making sense of self-reported socially significant data using computational methods
    Burnap, Peter
    Avis, Nick J.
    Rana, Omer F.
    [J]. INTERNATIONAL JOURNAL OF SOCIAL RESEARCH METHODOLOGY, 2013, 16 (03) : 215 - 230
  • [10] Author gender identification from text
    Cheng, Na
    Chandramouli, R.
    Subbalakshmi, K. P.
    [J]. DIGITAL INVESTIGATION, 2011, 8 (01) : 78 - 88