Measurement Practices in Large-Scale Replications: Insights From Many Labs 2

被引:27
作者
Shaw, Mairead [1 ]
Cloos, Leonie J. R. [2 ]
Luong, Raymond [1 ]
Elbaz, Sasha [3 ]
Flake, Jessica Kay [1 ]
机构
[1] McGill Univ, Dept Psychol, 2001 McGill Coll,7th Floor, Montreal, PQ H3A 1G1, Canada
[2] Leiden Univ, Dept Clin Psychol, Leiden, Netherlands
[3] Concordia Univ, Dept Psychol, Montreal, PQ, Canada
来源
CANADIAN PSYCHOLOGY-PSYCHOLOGIE CANADIENNE | 2020年 / 61卷 / 04期
关键词
measurement; replication; construct validity; measurement invariance; QUESTIONABLE RESEARCH PRACTICES; CONSTRUCT-VALIDATION; FIT INDEXES; REPLICABILITY; SENSITIVITY; INCENTIVES; VALIDITY; DISGUST; MODEL; TRUTH;
D O I
10.1037/cap0000220
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Validity of measurement is integral to the interpretability of research endeavours and any subsequent replication attempts. To assess current measurement practices and the construct validity of measures in large-scale replication studies, we conducted a systematic review of measures used in "Many Labs 2: Investigating Variation in Replicability Across Samples and Settings" (Klein et al., 2018). To evaluate the psychometric properties of the scales used in "Many Labs 2," we conducted factor and reliability analyses on the publicly available data. We report that measures in "Many Labs 2" were often short with little validity evidence reported in the original study, that measures with more validity evidence in the original study had stronger psychometric properties in the replication sample, and that translated versions of scales had lower reliability. We discuss the implications of these findings for interpreting replication results, and make recommendations to improve measurement practices in future replications.
引用
收藏
页码:289 / 298
页数:10
相关论文
共 53 条
  • [1] Estimating the reproducibility of psychological science
    Aarts, Alexander A.
    Anderson, Joanna E.
    Anderson, Christopher J.
    Attridge, Peter R.
    Attwood, Angela
    Axt, Jordan
    Babel, Molly
    Bahnik, Stepan
    Baranski, Erica
    Barnett-Cowan, Michael
    Bartmess, Elizabeth
    Beer, Jennifer
    Bell, Raoul
    Bentley, Heather
    Beyan, Leah
    Binion, Grace
    Borsboom, Denny
    Bosch, Annick
    Bosco, Frank A.
    Bowman, Sara D.
    Brandt, Mark J.
    Braswell, Erin
    Brohmer, Hilmar
    Brown, Benjamin T.
    Brown, Kristina
    Bruening, Jovita
    Calhoun-Sauls, Ann
    Callahan, Shannon P.
    Chagnon, Elizabeth
    Chandler, Jesse
    Chartier, Christopher R.
    Cheung, Felix
    Christopherson, Cody D.
    Cillessen, Linda
    Clay, Russ
    Cleary, Hayley
    Cloud, Mark D.
    Cohn, Michael
    Cohoon, Johanna
    Columbus, Simon
    Cordes, Andreas
    Costantini, Giulio
    Alvarez, Leslie D. Cramblet
    Cremata, Ed
    Crusius, Jan
    DeCoster, Jamie
    DeGaetano, Michelle A.
    Della Penna, Nicolas
    den Bezemer, Bobby
    Deserno, Marie K.
    [J]. SCIENCE, 2015, 349 (6251)
  • [2] American Educational Research Association American Psychological Association National Council on Measurement in Education, 2014, Standards for educational and psychological testing, DOI DOI 10.1037/14855-004
  • [3] Feeling the Future: Experimental Evidence for Anomalous Retroactive Influences on Cognition and Affect
    Bem, Daryl J.
    [J]. JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 2011, 100 (03) : 407 - 425
  • [4] Redefine statistical significance
    Benjamin, Daniel J.
    Berger, James O.
    Johannesson, Magnus
    Nosek, Brian A.
    Wagenmakers, E. -J.
    Berk, Richard
    Bollen, Kenneth A.
    Brembs, Bjoern
    Brown, Lawrence
    Camerer, Colin
    Cesarini, David
    Chambers, Christopher D.
    Clyde, Merlise
    Cook, Thomas D.
    De Boeck, Paul
    Dienes, Zoltan
    Dreber, Anna
    Easwaran, Kenny
    Efferson, Charles
    Fehr, Ernst
    Fidler, Fiona
    Field, Andy P.
    Forster, Malcolm
    George, Edward I.
    Gonzalez, Richard
    Goodman, Steven
    Green, Edwin
    Green, Donald P.
    Greenwald, Anthony
    Hadfield, Jarrod D.
    Hedges, Larry V.
    Held, Leonhard
    Ho, Teck Hua
    Hoijtink, Herbert
    Hruschka, Daniel J.
    Imai, Kosuke
    Imbens, Guido
    Ioannidis, John P. A.
    Jeon, Minjeong
    Jones, James Holland
    Kirchler, Michael
    Laibson, David
    List, John
    Little, Roderick
    Lupia, Arthur
    Machery, Edouard
    Maxwell, Scott E.
    McCarthy, Michael
    Moore, Don
    Morgan, Stephen L.
    [J]. NATURE HUMAN BEHAVIOUR, 2018, 2 (01): : 6 - 10
  • [5] Benson J., 1998, ED MEASUREMENT, V17, P10, DOI [10.1111/j.1745-3992.1998.tb00616.x, DOI 10.1111/J.1745-3992.1998.TB00616.X]
  • [6] Bhattacharjee Yudhijit., 2013, NEW YORK TIMES
  • [7] Browne M., 1993, Testing structural equation models, P136
  • [8] Guidelines for choosing between multi-item and single-item scales for construct measurement: a predictive validity perspective
    Diamantopoulos, Adamantios
    Sarstedt, Marko
    Fuchs, Christoph
    Wilczynski, Petra
    Kaiser, Sebastian
    [J]. JOURNAL OF THE ACADEMY OF MARKETING SCIENCE, 2012, 40 (03) : 434 - 449
  • [9] THE SATISFACTION WITH LIFE SCALE
    DIENER, E
    EMMONS, RA
    LARSEN, RJ
    GRIFFIN, S
    [J]. JOURNAL OF PERSONALITY ASSESSMENT, 1985, 49 (01) : 71 - 75
  • [10] Behavioral Priming: It's all in the Mind, but Whose Mind?
    Doyen, Stephane
    Klein, Olivier
    Pichon, Cora-Lise
    Cleeremans, Axel
    [J]. PLOS ONE, 2012, 7 (01):