Quantitative differences in retest effects across different methods used to construct alternate test forms

被引:17
作者
Arendasy, Martin E. [1 ]
Sommer, Markus [1 ]
机构
[1] Graz Univ, A-8010 Graz, Austria
关键词
Retest effect; Identical vs. alternate test forms; Cognitive ability; Automatic item generation; CRITERION-RELATED VALIDITY; AUTOMATIC ITEM GENERATION; MEASUREMENT INVARIANCE; FIT INDEXES; MEASUREMENT EQUIVALENCE; LONGITUDINAL DATA; MEASUREMENT BIAS; ABILITY TESTS; TEST ANXIETY; SELECTION;
D O I
10.1016/j.intell.2013.02.004
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Allowing respondents to retake a cognitive ability test has shown to increase their test scores. Several theoretical models have been proposed to explain this effect, which make distinct assumptions regarding the measurement invariance of psychometric tests across test administration sessions with regard to narrower cognitive abilities and general mental ability. We modeled retest effects in four psychometric tests as a function of specific retest form and general mental ability in order to compare the validity of these models and their generalizability across three different kinds of retest forms. To do so automatic item generation was used to construct two kinds of alternate retest: (1) isomorphic retests and (2) psychometrically matched retests. A total of N = 358 respondents completed all four measures twice, receiving either identical retest forms, isomorphic retest forms or psychometrically matched retest forms at the second test administration session. Item response theory modeling supported strict measurement invariance across all test forms and time-points of measurement but indicated variation in respondents' retest score gains due to individual differences in general mental ability and the kind of retest form used. In general, retest effects were more pronounced for high-g respondents, identical retests and isomorphic retest forms and for mental rotation and algebra word problems. Latent mean and covariance structure analyses indicated that retesting did not affect the g-factor saturation of the four cognitive ability tests but revealed that retest score gains were hollow with respect to psychometric g. (c) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:181 / 192
页数:12
相关论文
共 60 条
[1]   COACHING, TEST SOPHISTICATION, AND DEVELOPED ABILITIES [J].
ANASTASI, A .
AMERICAN PSYCHOLOGIST, 1981, 36 :1086-1093
[2]   GOODNESS OF FIT TEST FOR RASCH MODEL [J].
ANDERSEN, EB .
PSYCHOMETRIKA, 1973, 38 (01) :123-140
[3]  
[Anonymous], 2007, PRACTICAL ASSESSMENT, DOI DOI 10.7275/MHQA-CD89
[4]  
[Anonymous], 1980, PROBABILISTIC MODELS
[5]  
[Anonymous], 1998, The g factor
[6]  
Arbuckle J.A., 2003, AMOS 50 UPDATE AMOS
[7]  
Arendasy M., 2006, Journal of Individual Di_erences, V27, P2
[8]  
Arendasy M., 2011, ENZYKLOPADIE PSYCHOL, P215
[9]   Using psychometric technology in educational assessment: The case of a schema-based isomorphic approach to the automatic generation of quantitative reasoning items [J].
Arendasy, Martin ;
Sommer, Markus .
LEARNING AND INDIVIDUAL DIFFERENCES, 2007, 17 (04) :366-383
[10]   Using Automatic Item Generation to Simultaneously Construct German and English Versions of a Word Fluency Test [J].
Arendasy, Martin E. ;
Sommer, Markus ;
Mayr, Friedrich .
JOURNAL OF CROSS-CULTURAL PSYCHOLOGY, 2012, 43 (03) :464-479