Gaining Insights From Social Media Language: Methodologies and Challenges

被引:133
作者
Kern, Margaret L. [1 ]
Park, Gregory [2 ]
Eichstaedt, Johannes C. [2 ]
Schwartz, H. Andrew [3 ,4 ]
Sap, Maarten [2 ]
Smith, Laura K. [2 ]
Ungar, Lyle H. [3 ]
机构
[1] Univ Melbourne, Melbourne Grad Sch Educ, 100 Leicester St,Level 2, Parkville, Vic 3010, Australia
[2] Univ Penn, Dept Psychol, Philadelphia, PA 19104 USA
[3] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA
[4] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY USA
关键词
social media; linguistic analysis; interdisciplinary collaboration; online behavior; computational social science; LATENT SEMANTIC ANALYSIS; REGRESSION; PERSONALITY; ALGORITHM; SELECTION; NETWORK; SCIENCE; HEALTH; USERS; TEXT;
D O I
10.1037/met0000091
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Language data available through social media provide opportunities to study people at an unprecedented scale. However, little guidance is available to psychologists who want to enter this area of research. Drawing on tools and techniques developed in natural language processing, we first introduce psychologists to social media language research, identifying descriptive and predictive analyses that language data allow. Second, we describe how raw language data can be accessed and quantified for inclusion in subsequent analyses, exploring personality as expressed on Facebook to illustrate. Third, we highlight challenges and issues to be considered, including accessing and processing the data, interpreting effects, and ethical issues. Social media has become a valuable part of social life, and there is much we can learn by bringing together the tools of computer science with the theories and insights of psychology.
引用
收藏
页码:507 / 525
页数:19
相关论文
共 123 条
[41]  
Friedman HowardS., 2011, LONGEVITY PROJECT SU
[42]  
GILL A, 2004, THESIS
[43]   The international personality item pool and the future of public-domain personality measures [J].
Goldberg, LR ;
Johnson, JA ;
Eber, HW ;
Hogan, R ;
Ashton, MC ;
Cloninger, CR ;
Gough, HG .
JOURNAL OF RESEARCH IN PERSONALITY, 2006, 40 (01) :84-96
[44]   Topics in semantic representation [J].
Griffiths, Thomas L. ;
Steyvers, Mark ;
Tenenbaum, Joshua B. .
PSYCHOLOGICAL REVIEW, 2007, 114 (02) :211-244
[45]   Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts [J].
Grimmer, Justin ;
Stewart, Brandon M. .
POLITICAL ANALYSIS, 2013, 21 (03) :267-297
[46]  
Grossman DavidA., 2012, Information Retrieval: Algorithms and Heuristics, V15
[47]   Discrete affects across the adult lifespan: Evidence for multidimensionality and multidirectionality of affective experiences in young, middle-aged and older adults [J].
Gruehn, Daniel ;
Kotter-Gruehn, Dana ;
Roecke, Christina .
JOURNAL OF RESEARCH IN PERSONALITY, 2010, 44 (04) :492-500
[48]  
Guyon I., 2003, J Mach Learn Res, V3, P1157, DOI DOI 10.5555/944919.944968
[49]   Childhood Conscientiousness Relates to Objectively Measured Adult Physical Health Four Decades Later [J].
Hampson, Sarah E. ;
Edmonds, Grant W. ;
Goldberg, Lewis R. ;
Dubanoski, Joan P. ;
Hillier, Teresa A. .
HEALTH PSYCHOLOGY, 2013, 32 (08) :925-928
[50]  
Han B., 2012, P COLING 2012, P1045