Explaining Individual and Collective Programming Students' Behavior by Interpreting a Black-Box Predictive Model

被引:30
作者
Pereira, Filipe Dwan [1 ]
Fonseca, Samuel C. [2 ]
Oliveira, Elaine H. T. [2 ]
Cristea, Alexandra, I [3 ]
Bellhauser, Henrik [4 ]
Rodrigues, Luiz [5 ]
Oliveira, David B. F. [2 ]
Isotani, Seiji [5 ]
Carvalho, Leandro S. G. [2 ]
机构
[1] Univ Fed Roraima, Dept Comp Sci, BR-69310000 Boa Vista, Parana, Brazil
[2] Univ Fed Amazonas, Inst Comp, BR-69067005 Manaus, Amazonas, Brazil
[3] Univ Durham, Dept Comp Sci, Durham DH1 3LE, England
[4] Johannes Gutenberg Univ Mainz, Dept Psychol, D-55122 Mainz, Germany
[5] Univ Sao Paulo, Inst Math & Comp Sci, BR-13566590 Sao Carlos, Brazil
关键词
Predictive models; Programming; Deep learning; Programming profession; Feature extraction; Analytical models; Computational modeling; Explainable artificial intelligence; online judges; learning analytics; CS1; computing in education; early prediction; shapley values; FORMATIVE ASSESSMENT; MOTIVATION;
D O I
10.1109/ACCESS.2021.3105956
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Predicting student performance as early as possible and analysing to which extent initial student behaviour could lead to failure or success is critical in introductory programming (CS1) courses, for allowing prompt intervention in a move towards alleviating their high failure rate. However, in CS1 performance prediction, there is a serious lack of studies that interpret the predictive model's decisions. In this sense, we designed a long-term study using very fine-grained log-data of 2056 students, collected from the first two weeks of CS1 courses. We extract features that measure how students deal with deadlines, how they fix errors, how much time they spend programming, and so forth. Subsequently, we construct a predictive model that achieved cutting-edge results with area under the curve (AUC) of.89, and an average accuracy of 81.3%. To allow an effective intervention and to facilitate human-AI collaboration towards prescriptive analytics, we, for the first time, to the best of our knowledge, go a step further than the prediction itself and leverage this field by proposing an approach to explaining our predictive model decisions individually and collectively using a game-theory based framework (SHAP), (Lundberg et al., 2020) that allows interpreting our black-box non-linear model linearly. In other words, we explain the feature effects, clearly by visualising and analysing individual predictions, the overall importance of features, and identification of typical prediction paths. This method can be further applied to other emerging competitive models, as the CS1 prediction field progresses, ensuring transparency of the process for key stakeholders: administrators, teachers, and students.
引用
收藏
页码:117097 / 117119
页数:23
相关论文
共 72 条
[1]  
Ahadi Alireza., 2016, Proceedings of the 2016 ACM Conference on Innovation and Technology in Computer Science Education. ITiCSE '16, P218, DOI DOI 10.1145/2899415.2899452
[2]  
[Anonymous], 2014, P 2014 C INN TECHN C
[3]  
[Anonymous], 2015, P 11 ANN INT C INT C, DOI DOI 10.1145/2787622.2787717
[4]  
[Anonymous], 2006, P 2 INT WORKSHOP COM, DOI 10.1145/1151588.1151600
[5]   On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation [J].
Bach, Sebastian ;
Binder, Alexander ;
Montavon, Gregoire ;
Klauschen, Frederick ;
Mueller, Klaus-Robert ;
Samek, Wojciech .
PLOS ONE, 2015, 10 (07)
[6]  
Baker R.S., 2014, LEARNING ANAL, P61, DOI DOI 10.1007/978-1-4614-3305-7_4
[7]  
Batista G, 2004, ACM SIGKDD Explor Newsl, V6, P20, DOI DOI 10.1145/1007730.1007735
[8]   Applying a web-based training to foster self-regulated learning - Effects of an intervention for large numbers of participants [J].
Bellhaeuser, Henrik ;
Loesch, Thomas ;
Winter, Charlotte ;
Schmitz, Bernhard .
INTERNET AND HIGHER EDUCATION, 2016, 31 :87-100
[9]   AI in education: learner choice and fundamental rights [J].
Berendt, Bettina ;
Littlejohn, Allison ;
Blakemore, Mike .
LEARNING MEDIA AND TECHNOLOGY, 2020, 45 (03) :312-324
[10]  
Bez JL, 2014, INT CONF COMP SCI ED, P149, DOI 10.1109/ICCSE.2014.6926445