Evaluating the Characteristics of Diagnostic Items for Bridging Errors in Multi-Digit Subtraction

被引:2
作者
Vermeulen, Jorine Adinda [1 ]
Beguin, Anton [2 ]
Scheltens, Floor [3 ]
Eggen, Theo J. H. M. [2 ,3 ]
机构
[1] Inholland Univ Appl Sci, Teacher Training Primary Educ, Fac Educ & Innovat, Rotterdam, Netherlands
[2] Cito, Arnhem, Netherlands
[3] Univ Twente, Dept Res Methodol Measurement & Data Anal, Fac Behav Sci, Enschede, Netherlands
关键词
diagnostic assessment; multi-digit subtraction; multiple-choice items; item characteristics; decomposition; subtraction errors; bridging errors; CONCEPTUAL STRUCTURES; STRATEGIES; NUMBERS;
D O I
10.3389/feduc.2020.537531
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Subtraction errors can inform teachers about students' mathematical reasoning. Not every subtraction error is informative, it's implications for students' mathematical reasoning depends on the item characteristics. Diagnostic items are specifically designed to elicit specific subtraction errors. This study evaluated how the diagnostic capacity of subtraction items is related to their characteristics. The item characteristics being studied are open-ended and multiple-choice (MC) items, bare number, and word problems. As well as various number features, such as the number of digits in the subtrahend and minuend. Diagnostic capacity is defined as the extent to which multi-digit subtraction items that require borrowing (e.g., 1000-680) elicit bridging errors, such as the smaller-from-larger-error. Item response theory (IRT) was used to estimate item properties. Subsequently, the item properties were used in two separate ANOVA analyses to compare the diagnostic capacity of MC versus open-ended items, bare number versus word problems, and number features. As expected, MC items have a higher diagnostic capacity than open-ended items. More interestingly, it was found that the number of digits in the subtrahend and minuend influenced the diagnostic capacity of the items. Items characterized as 3/4n-3n, like 1000-680 had the highest diagnostic capacity, whereas items characterized as 3/4n-2n, such as 1000-20 had the lowest diagnostic capacity. The discussion focuses on the implications of this study for further research into the design of diagnostic items.
引用
收藏
页数:11
相关论文
共 44 条
[1]  
Alonzo A. C., 2009, LEARN PROGR SCI LEAP
[2]  
[Anonymous], 2017, AUSTR PRIM MATH CLAS
[3]  
Ashlock R., 2006, ERROR PATTERNS COMPU
[5]   Mental computation and conceptual understanding [J].
Blöte, AW ;
Klein, AS ;
Beishuizen, M .
LEARNING AND INSTRUCTION, 2000, 10 (03) :221-247
[6]   Diagnostic Assessment With Ordered Multiple-Choice Items [J].
Briggs, Derek C. ;
Alonzo, Alicia C. ;
Schwab, Cheryl ;
Wilson, Mark .
EDUCATIONAL ASSESSMENT, 2006, 11 (01) :33-63
[7]  
BROWN JS, 1980, COGNITIVE SCI, V4, P379, DOI 10.1016/S0364-0213(80)80010-3
[8]  
Eggen TJHM, 2011, PSICOLOGICA, V32, P107
[9]  
Fernandez R.L., 2008, Educational Research and Review, V3, P229
[10]   Children's conceptual structures for multidigit numbers and methods of multidigit addition and subtraction [J].
Fuson, KC ;
Wearne, D ;
Hiebert, JC ;
Murray, HG ;
Human, PG ;
Olivier, AI ;
Carpenter, TP ;
Fennema, E .
JOURNAL FOR RESEARCH IN MATHEMATICS EDUCATION, 1997, 28 (02) :130-162