Applying machine learning to automatically assess scientific models

被引:45
作者
Zhai, Xiaoming [1 ,2 ]
He, Peng [3 ]
Krajcik, Joseph [3 ]
机构
[1] Univ Georgia, Dept Math Sci & Social Studies Educ, 105J Aderhold Hall,110 Carlton St, Athens, GA 30602 USA
[2] Univ Georgia, Inst Artificial Intelligence, 105J Aderhold Hall,110 Carlton St, Athens, GA 30602 USA
[3] Michigan State Univ, CREATE STEM Inst, E Lansing, MI 48824 USA
基金
美国国家科学基金会;
关键词
artificial intelligence; artificial neural networks; deep learning; inclusive assessment; machine learning; natural language processing; scientific model; STUDENTS; SCIENCE; REPRESENTATIONS; CHEMISTRY; KNOWLEDGE; THINKING; IMPACT; TEXT;
D O I
10.1002/tea.21773
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Involving students in scientific modeling practice is one of the most effective approaches to achieving the next generation science education learning goals. Given the complexity and multirepresentational features of scientific models, scoring student-developed models is time- and cost-intensive, remaining one of the most challenging assessment practices for science education. More importantly, teachers who rely on timely feedback to plan and adjust instruction are reluctant to use modeling tasks because they could not provide timely feedback to learners. This study utilized machine learning (ML), the most advanced artificial intelligence (AI), to develop an approach to automatically score student-drawn models and their written descriptions of those models. We developed six modeling assessment tasks for middle school students that integrate disciplinary core ideas and crosscutting concepts with the modeling practice. For each task, we asked students to draw a model and write a description of that model, which gave students with diverse backgrounds an opportunity to represent their understanding in multiple ways. We then collected student responses to the six tasks and had human experts score a subset of those responses. We used the human-scored student responses to develop ML algorithmic models (AMs) and to train the computer. Validation using new data suggests that the machine-assigned scores achieved robust agreements with human consent scores. Qualitative analysis of student-drawn models further revealed five characteristics that might impact machine scoring accuracy: Alternative expression, confusing label, inconsistent size, inconsistent position, and redundant information. We argue that these five characteristics should be considered when developing machine-scorable modeling tasks.
引用
收藏
页码:1765 / 1794
页数:30
相关论文
共 81 条
[1]   DeFT: A conceptual framework for considering learning with multiple representations [J].
Ainsworth, Shaaron .
LEARNING AND INSTRUCTION, 2006, 16 (03) :183-198
[2]   Words or Pictures: A comparison of written and pictorial explanations of physical and chemical equilibria [J].
Akaygun, Sevil ;
Jones, Loretta L. .
INTERNATIONAL JOURNAL OF SCIENCE EDUCATION, 2014, 36 (05) :783-807
[3]  
Allen-Zhu Z, 2019, ADV NEUR IN, V32
[4]  
Arora S, 2019, 33 C NEURAL INFORM P, V32
[5]   Impact of model-based science curriculum and instruction on elementary students' explanations for the hydrosphere [J].
Baumfalk, Ben ;
Bhattacharya, Devarati ;
Vo, Tina ;
Forbes, Cory ;
Zangori, Laura ;
Schwarz, Christina .
JOURNAL OF RESEARCH IN SCIENCE TEACHING, 2019, 56 (05) :570-597
[6]   Testing the Impact of Novel Assessment Sources and Machine Learning Methods on Predictive Outcome Modeling in Undergraduate Biology [J].
Bertolini, Roberto ;
Finch, Stephen J. ;
Nehm, Ross H. .
JOURNAL OF SCIENCE EDUCATION AND TECHNOLOGY, 2021, 30 (02) :193-209
[7]   Formats and prior knowledge on learning in a computer-based lesson [J].
ChanLin, L .
JOURNAL OF COMPUTER ASSISTED LEARNING, 2001, 17 (04) :409-419
[8]   Epistemic uncertainty and the support of productive struggle during scientific modeling for knowledge co-development [J].
Chen, Ying-Chih .
JOURNAL OF RESEARCH IN SCIENCE TEACHING, 2022, 59 (03) :383-422
[9]  
Ciresan DC, 2012, IEEE IJCNN
[10]   The MNIST database of handwritten digit images for machine learning research [J].
Deng, Li .
IEEE Signal Processing Magazine, 2012, 29 (06) :141-142