Validity and reliability of measurement instruments used in research

被引:780
作者
Kimberlin, Carole L. [1 ]
Winterstein, Almut G. [1 ]
机构
[1] Univ Florida, Coll Pharm, Dept Pharmaceut Outcomes & Policy, Gainesville, FL 32610 USA
关键词
Control; quality; Data collection; Errors; Methodology; Research;
D O I
10.2146/ajhp070364
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Purpose. Issues related to the validity and reliability of measurement instruments used in research are reviewed. Summary. Key indicators of the quality of a measuring instrument are the reliability and validity of the measures. The process of developing and validating an instrument is in large part focused on reducing error in the measurement process. Reliability estimates evaluate the stability of measures, internal consistency of measurement instruments, and interrater reliability of instrument scores. Validity is the extent to which the interpretations of the results of a test are warranted, which depends on the particular use the test is intended to serve. The responsiveness of the measure to change is of interest in many of the applications in health care where improvement in outcomes as a result of treatment is a primary goal of research. Several issues may affect the accuracy of data collected, such as those related to self-report and secondary data sources. Self-report of patients or subjects is required for many of the measurements conducted in health care, but self-reports of behavior are particularly subject to problems with social desirability biases. Data that were originally gathered for a different purpose are often used to answer a research question, which can affect the applicability to the study at hand. Conclusion. In health care and social science research, many of the variables of interest and outcomes that are important are abstract concepts known as theoretical constructs. Using tests or instruments that are valid and reliable to measure such constructs is a crucial component of research quality.
引用
收藏
页码:2276 / 2284
页数:9
相关论文
共 29 条
[1]   Misclassification of exposure is high when interview data on drug use are used as a proxy measure of chronic drug use during follow-up [J].
Beiderbeck, AB ;
Sturkenboom, MCJM ;
Coebergh, JWW ;
Leufkens, HGM ;
Stricker, BHC .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2004, 57 (09) :973-977
[2]  
Bond TG, 2001, APPL RASCH MODEL FUN, P1
[3]   CONVERGENT AND DISCRIMINANT VALIDATION BY THE MULTITRAIT-MULTIMETHOD MATRIX [J].
CAMPBELL, DT ;
FISKE, DW .
PSYCHOLOGICAL BULLETIN, 1959, 56 (02) :81-105
[4]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[5]  
Cronbach LJ, 1951, PSYCHOMETRIKA, V16, P297
[6]  
D'Agostino Ralph B Jr, 2000, Curr Control Trials Cardiovasc Med, V1, P76, DOI 10.1186/CVM-1-2-076
[7]   Classical test theory [J].
DeVellis, Robert F. .
MEDICAL CARE, 2006, 44 (11) :S50-S59
[8]  
Gearing RE, 2006, J CAN ACAD CHILD ADO, V15, P126
[9]   Chart reviews in emergency medicine research: Where are the methods? [J].
Gilbert, EH ;
Lowenstein, SR ;
KoziolMcLain, J ;
Barta, DC ;
Steiner, J .
ANNALS OF EMERGENCY MEDICINE, 1996, 27 (03) :305-308
[10]  
Hambleton R.K., 1999, J APPL TESTING TECHN, V1, P1, DOI DOI 10.1186/1471-2288-10-13