Data Collection in a Flat World: The Strengths and Weaknesses of Mechanical Turk Samples

被引:1606
作者
Goodman, Joseph K. [1 ]
Cryder, Cynthia E. [1 ]
Cheema, Amar [2 ]
机构
[1] Washington Univ, Olin Business Sch, St Louis, MO 63130 USA
[2] Univ Virginia, McIntire Sch Commerce, Charlottesville, VA USA
关键词
research methods; surveys; sampling; online research; external validity; PROSPECT-THEORY; SELF-ESTEEM; DECISIONS; VALIDATION; FRAMES; SCALE;
D O I
10.1002/bdm.1753
中图分类号
B849 [应用心理学];
学科分类号
040203 ;
摘要
Mechanical Turk (MTurk), an online labor system run by Amazon.com, provides quick, easy, and inexpensive access to online research participants. As use of MTurk has grown, so have questions from behavioral researchers about its participants, reliability, and low compensation. In this article, we review recent research about MTurk and compare MTurk participants with community and student samples on a set of personality dimensions and classic decision-making biases. Across two studies, we find many similarities between MTurk participants and traditional samples, but we also find important differences. For instance, MTurk participants are less likely to pay attention to experimental materials, reducing statistical power. They are more likely to use the Internet to find answers, even with no incentive for correct responses. MTurk participants have attitudes about money that are different from a community sample's attitudes but similar to students' attitudes. Finally, MTurk participants are less extraverted and have lower self-esteem than other participants, presenting challenges for some research domains. Despite these differences, MTurk participants produce reliable results consistent with standard decision-making biases: they are present biased, risk-averse for gains, risk-seeking for losses, show delay/expedite asymmetries, and show the certainty effectwith almost no significant differences in effect sizes from other samples. We conclude that MTurk offers a highly valuable opportunity for data collection and recommend that researchers using MTurk (1) include screening questions that gauge attention and language comprehension; (2) avoid questions with factual answers; and (3) consider how individual differences in financial and social domains may influence results. Copyright (c) 2012 John Wiley & Sons, Ltd.
引用
收藏
页码:213 / 224
页数:12
相关论文
共 39 条
[1]  
[Anonymous], TIME VERSUS MO UNPUB
[2]  
[Anonymous], 2010, 1001 CEDER NEW YORK
[3]  
[Anonymous], J THEORETICAL BIOL
[4]   Small feedback-based decisions and their limited correspondence to description-based decisions [J].
Barron, G ;
Erev, I .
JOURNAL OF BEHAVIORAL DECISION MAKING, 2003, 16 (03) :215-233
[5]   Amazon's Mechanical Turk: A New Source of Inexpensive, Yet High-Quality, Data? [J].
Buhrmester, Michael ;
Kwang, Tracy ;
Gosling, Samuel D. .
PERSPECTIVES ON PSYCHOLOGICAL SCIENCE, 2011, 6 (01) :3-5
[6]  
Chatterjee S, 2000, J BEHAV DECIS MAKING, V13, P61, DOI 10.1002/(SICI)1099-0771(200001/03)13:1<61::AID-BDM343>3.0.CO
[7]  
2-J
[8]  
Eriksson K, 2010, JUDGM DECIS MAK, V5, P159
[9]   Cognitive reflection and decision making [J].
Frederick, S .
JOURNAL OF ECONOMIC PERSPECTIVES, 2005, 19 (04) :25-42
[10]  
Friedman T., 2005, The World is Flat