Measurement issues in causal inference

被引:1
作者
Shear, Benjamin R. [1 ]
Briggs, Derek C. [1 ]
机构
[1] Univ Colorado Boulder, Sch Educ, 249 UCB, Boulder, CO 80309 USA
关键词
Validity; Reliability; Measurement; Causal inference; MEASUREMENT ERROR; STATISTICS; VALIDATION; ANCOVA;
D O I
10.1007/s12564-024-09942-9
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Research in the social and behavioral sciences relies on a wide range of experimental and quasi-experimental designs to estimate the causal effects of specific programs, policies, and events. In this paper we highlight measurement issues relevant to evaluating the validity of causal estimation and generalization. These issues impact all four categories of threats to validity previously delineated by Shadish et al. (Experimental and quasi-experimental designs for generalized causal inference. Houghton Mifflin, Boston, 2002): internal, external, statistical conclusion, and construct validity. We use the context of estimating the effect of the COVID-19 pandemic on student learning in the U.S. to illustrate the important role of measurement in causal inference. We provide background related to the meaning of measurement, and focus attention on the evidence and argumentation necessary to evaluate the validity and reliability of the different types of measures used in statistical models for causal inference. We conclude with recommendations for researchers estimating and generalizing causal effects: provide clear statements for construct interpretations, seek to rule out potential sources of construct-irrelevant variance, quantify and adjust for measurement error, and consider the extent to which interpretations of practical significance are consistent with scale properties of outcome measures.
引用
收藏
页码:719 / 731
页数:13
相关论文
共 84 条
  • [1] American Educational Research Association Psychological Association National Council on Measurement in Education, 2014, STAND ED PSYCH TEST
  • [2] Bloom H.S., 2008, J RES EDUC EFF, V1, P289, DOI [https://doi.org/10.1080/19345740802400072, DOI 10.1080/19345740802400072]
  • [3] Bollen K.A., 1989, STRUCTURAL EQUATIONS, DOI [DOI 10.1002/9781118619179, https://doi.org/10.1002/9781118619179]
  • [4] The use of test scores from large-scale assessment surveys: psychometric and statistical considerations
    Braun H.
    von Davier M.
    [J]. Large-scale Assessments in Education, 5 (1)
  • [5] Brennan RL, 2001, Generalizability Theory, DOI DOI 10.1007/978-1-4757-3456-0
  • [6] Briggs D.C., 2021, Historical and conceptual foundations of measurement in the human sciences: Credos and controversies, V1st, DOI [DOI 10.1201/9780429275326, https://doi.org/10.1201/9780429275326]
  • [7] Briggs D.C., 2021, HIST ED MEASUREMENT
  • [8] Briggs D. C., IN PRESS
  • [9] The Gains From Vertical Scaling
    Briggs, Derek C.
    Domingue, Ben
    [J]. JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2013, 38 (06) : 551 - 576
  • [10] Buonaccorsi JP, 2010, INTERD STAT, P1, DOI 10.1201/9781420066586