What sample sizes for reliability and validity studies in neurology?

被引:140
作者
Hobart, Jeremy C. [1 ,2 ]
Cano, Stefan J. [2 ]
Warner, Thomas T. [3 ]
Thompson, Alan J. [4 ]
机构
[1] Peninsula Coll Med & Dent, Dept Clin Neurosci, Plymouth PL6 8BX, Devon, England
[2] Peninsula Coll Med & Dent, Clin Neurol Res Grp, Plymouth PL6 8BX, Devon, England
[3] UCL Inst Neurol, Dept Clin Neurosci, London, England
[4] UCL Inst Neurol, Dept Brain Repair & Rehabil, London, England
关键词
Multiple sclerosis; Cervical dystonia; Reliability; Validity; Sample size; FORM HEALTH SURVEY; IMPACT SCALE MSIS-29; QUALITY-OF-LIFE; COEFFICIENT ALPHA; SURVEY SF-36; REQUIREMENTS; DYSTONIA; CDIP-58; ERROR; TESTS;
D O I
10.1007/s00415-012-6570-y
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
Rating scales are increasingly used in neurologic research and trials. A key question relating to their use across the range of neurologic diseases, both common and rare, is what sample sizes provide meaningful estimates of reliability and validity. Here, we address two questions: (1) to what extent does sample size influence the stability of reliability and validity estimates; and (2) to what extent does sample size influence the inferences made from reliability and validity testing? We examined data from two studies. In Study 1, we retrospectively reduced the total sample randomly and nonrandomly by decrements of approximately 50 % to generate sub-samples from n = 713-20. In Study 2, we prospectively generated sub-samples from n = 20-320, by entry time into study. In all samples we estimated reliability (internal consistency, item total correlations, test-retest) and validity (within scale correlations, convergent and discriminant construct validity). Reliability estimates were stable in magnitude and interpretation in all sub-samples of both studies. Validity estimates were stable in samples of n a parts per thousand yen 80, for 75 % of scales in samples of n = 40, and for 50 % of scales in samples of n = 20. In this study, sample sizes of a minimum of 20 for reliability and 80 for validity provided estimates highly representative of the main study samples. These findings should be considered provisional and more work is needed to determine if these estimates are generalisable, consistent, and useful.
引用
收藏
页码:2681 / 2694
页数:14
相关论文
共 56 条
[1]  
Aaronson N, 2002, QUAL LIFE RES, V11, P193
[2]  
[Anonymous], EQ EXC LIB NHS
[3]  
[Anonymous], 2009, PAT REP OUTC MEAS US
[4]  
[Anonymous], 2010, QUAL PROC DRUG DEV T
[5]  
Barry P. J., 1981, [Publication] Department of Agricultural Economics, Cornell University
[6]   PRACTICAL ISSUES IN STRUCTURAL MODELING [J].
BENTLER, PM ;
CHOU, CP .
SOCIOLOGICAL METHODS & RESEARCH, 1987, 16 (01) :78-117
[7]   Sample size requirements for testing and estimating coefficient alpha [J].
Bonett, DG .
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2002, 27 (04) :335-340
[8]   Sample size requirements for estimating intraclass correlations with desired precision [J].
Bonett, DG .
STATISTICS IN MEDICINE, 2002, 21 (09) :1331-1335
[9]   CDIP-58 can measure the impact of botulinum toxin treatment in cervical dystonia [J].
Cano, S. J. ;
Hobart, J. C. ;
Edwards, M. ;
Fitzpatrick, R. ;
Bhatia, K. ;
Thompson, A. J. ;
Warner, T. T. .
NEUROLOGY, 2006, 67 (12) :2230-2232
[10]   Capturing the true burden of dystonia on patients - The Cervical Dystonia Impact Profile (CDIP-58) [J].
Cano, SJ ;
Warner, TT ;
Linacre, JM ;
Bhatia, KP ;
Thompson, AJ ;
Fitzpatrick, R ;
Hobart, JC .
NEUROLOGY, 2004, 63 (09) :1629-1633