Research on learning behavior patterns from the perspective of educational data mining: Evaluation, prediction and visualization

被引:18
作者
Feng, Guiyun [1 ]
Fan, Muwei [1 ]
机构
[1] Guizhou Univ, Sch Management, Guiyang 550025, Peoples R China
关键词
Educational data mining; Learning behavior patterns; Evaluation methodologies; Classification algorithms; STUDENT PERFORMANCE; MODEL; LOGITBOOST;
D O I
10.1016/j.eswa.2023.121555
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid growth of educational data creates the requirement to mine useful information from learning behavior patterns. The development of data mining technology makes educational data mining possible. The paper intends to use a public educational data set to study learning behavior patterns from the perspective of educational data mining, so as to promote the innovation of educational management. Firstly, in order to reduce the dimension of data analysis that facilitates the improvement in efficiency, principal component analysis is carried out to reduce the number of attributes in the data set. The significant attributes in the rotating principal component matrix rather than principal components which are not closely related to learning behavior patterns are extracted as the research variables. Then, a pseudo statistic is proposed to determine the number of clusters and the preprocessed data set is clustered according to the extracted attributes. The clustering results are applied to add class labels to the data, which is convenient for the later data training. Finally, six classification algorithms J48, K-Nearest Neighbor, Bayes Net, Random Forest, Support Vector Machine and Logit Boost are used to train the data with labels and build prediction models. At the same time, the performance and applicable conditions of six classifiers in terms of accuracy, efficiency, error, and so on are discussed and compared. It is found that the performance of the integrated algorithm is better than that of a single classifier. In the integrated algorithm, compared with Random Forest, the running time of Logit Boost is shorter.
引用
收藏
页数:11
相关论文
共 45 条
[11]   Analysis of Students' Behavior Through User Clustering in Online Learning Settings, Based on Self Organizing Maps Neural Networks [J].
Delgado, Soledad ;
Moran, Federico ;
San Jose, Jose Carlos ;
Burgos, Daniel .
IEEE ACCESS, 2021, 9 :132592-132608
[12]   On the short-term predictability of stock returns: A quantile boosting approach [J].
Demirer, Riza ;
Pierdzioch, Christian ;
Zhang, Huacheng .
FINANCE RESEARCH LETTERS, 2017, 22 :35-41
[13]   Exploration and Visualization of Learning Behavior Patterns From the Perspective of Educational Process Mining [J].
Feng, Guiyun ;
Fan, Muwei ;
Ao, Chuchu .
IEEE ACCESS, 2022, 10 :65271-65283
[14]   Analysis and Prediction of Students' Academic Performance Based on Educational Data Mining [J].
Feng, Guiyun ;
Fan, Muwei ;
Chen, Yu .
IEEE ACCESS, 2022, 10 :19558-19571
[15]   A decision-theoretic generalization of on-line learning and an application to boosting [J].
Freund, Y ;
Schapire, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) :119-139
[16]   Additive logistic regression: A statistical view of boosting - Rejoinder [J].
Friedman, J ;
Hastie, T ;
Tibshirani, R .
ANNALS OF STATISTICS, 2000, 28 (02) :400-407
[17]   Deep cognitive diagnosis model for predicting students' performance [J].
Gao, Lina ;
Zhao, Zhongying ;
Li, Chao ;
Zhao, Jianli ;
Zeng, Qingtian .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 126 :252-262
[18]   LogitBoost autoregressive networks [J].
Goessling, Marc .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2017, 112 :88-98
[19]  
Goldberger J., 2004, Advances in Neural Information Processing Systems 17, V17, P513
[20]   Predicting student academic performance in an engineering dynamics course: A comparison of four types of predictive mathematical models [J].
Huang, Shaobo ;
Fang, Ning .
COMPUTERS & EDUCATION, 2013, 61 :133-145