Development and validation of immediate self-feedback very short answer questions for medical students: practical implementation of generalizability theory to estimate reliability in formative examination designs

被引:6
作者
Lertsakulbunlue, Sethapong [1 ]
Kantiwong, Anupong [1 ]
机构
[1] Phramongkutklao Coll Med, Dept Pharmacol, Bangkok 10400, Thailand
关键词
Formative examination; Self-assessment; Immediate feedback; VSAQ; Generalizability theory; Medical Student; METAANALYSIS COMPARING PEER; GUIDE; VALIDITY;
D O I
10.1186/s12909-024-05569-x
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Background Very Short Answer Questions (VSAQs) reduce cueing and simulate better real-clinical practice compared with multiple-choice questions (MCQs). While integrating them into formative exams has potential, addressing marking time and ideal occasions and items is crucial. This study gathers validity evidence of novel immediate self-feedback VSAQ (ISF-VSAQ) format and determines the optimal number of items and occasions for reliable assessment.Methods Ninety-four third-year pre-clinical students took two ten-item ISF-VSAQ exams on cardiovascular drugs. Each question comprised two sections: (1) Questions with space for student responses and (2) a list of possible correct answers offering partial-credit scores ranging from 0.00 to 1.00, along with self-marking and self-feedback options to indicate whether they fully, partially, or did not understand the possible answers. Messick's validity framework guided the collection of validity evidence.Results Validity evidence included five sources: (1) Content: The expert reviewed the ISF-VSAQ format, and the question was aligned with a standard examination blueprint. (2) Response process: Before starting, students received an example and guide to the ISF-VSAQ, and the teacher detailed the steps in the initial session to aid self-assessment. Unexpected answers were comprehensively reviewed by experts. (3) Internal structure: The Cronbach alphas are good for both occasions (>= 0.70). A generalizability study revealed Phi-coefficients of 0.60, 0.71, 0.76, and 0.79 for one to four occasions with ten items, respectively. One occasion requires twenty-five items for acceptable reliability (Phi-coefficient = 0.72). (4) Relations to other variables: Inter-rater reliability between self-marking and teacher is excellent for each item (rs(186) = 0.87-0.98,p = 0.001). (5) Consequences: Path analysis revealed that the self-reflected understanding score in the second attempt directly affected the final MCQ score (beta = 0.25,p = 0.033). However, the VSAQ score did not. Regarding perceptions, over 80% of students strongly agreed/agreed that the ISF-VSAQ format enhances problem analysis, presents realistic scenarios, develops knowledge, offers feedback, and supports electronic usability.Conclusion Electronic ISF-VSAQs enhanced understanding elevates learning outcomes, rendering them suitable for formative assessments with clinical scenarios. Increasing the number of occasions effectively enhances reliability. While self-marking is reliable and may reduce grading efforts, instructors should review answers to identify common student errors.
引用
收藏
页数:13
相关论文
共 42 条
[1]   Use of Generalizability Theory for Exploring Reliability of and Sources of Variance in Assessment of Technical Skills: A Systematic Review and Meta-Analysis [J].
Andersen, Steven Arild Wuyts ;
Nayahangan, Leizl Joy ;
Park, Yoon Soo ;
Konge, Lars .
ACADEMIC MEDICINE, 2021, 96 (11) :1609-1619
[2]  
ARNOLD L, 1985, J MED EDUC, V60, P21
[3]   Twelve tips for introducing very short answer questions (VSAQs) into your medical curriculum [J].
Bala, Laksha ;
Westacott, Rachel J. ;
Brown, Celia ;
Sam, Amir H. .
MEDICAL TEACHER, 2023, 45 (04) :360-367
[4]   Generalizability theory for the perplexed: A practical introduction and guide: AMEE Guide No. 68 [J].
Bloch, Ralph ;
Norman, Geoffrey .
MEDICAL TEACHER, 2012, 34 (11) :960-992
[5]   Generalizability Theory and Classical Test Theory [J].
Brennan, Robert L. .
APPLIED MEASUREMENT IN EDUCATION, 2011, 24 (01) :1-21
[6]   Generalizability theory: A practical guide to study design, implementation, and interpretation [J].
Briesch, Amy M. ;
Swaminathan, Hariharan ;
Welsh, Megan ;
Chafouleas, Sandra M. .
JOURNAL OF SCHOOL PSYCHOLOGY, 2014, 52 (01) :13-35
[7]   Feedback in the clinical setting [J].
Burgess, Annette ;
van Diggele, Christie ;
Roberts, Chris ;
Mellis, Craig .
BMC MEDICAL EDUCATION, 2020, 20 (Suppl 2)
[8]   Preparing and Presenting Validation Studies A Guide for the Perplexed [J].
Calhoun, Aaron W. ;
Scerbo, Mark W. .
SIMULATION IN HEALTHCARE-JOURNAL OF THE SOCIETY FOR SIMULATION IN HEALTHCARE, 2022, 17 (06) :357-365
[9]   A Review of the EDUG Software for Generalizability Analysis [J].
Clauser, Brian E. .
INTERNATIONAL JOURNAL OF TESTING, 2008, 8 (03) :296-301
[10]   Evaluating Statistical Targets for Assembling Parallel Mixed-Format Test Forms [J].
Debeer, Dries ;
Ali, Usama S. ;
van Rijn, Peter W. .
JOURNAL OF EDUCATIONAL MEASUREMENT, 2017, 54 (02) :218-242