Mining educational data to predict students performance A comparative study of data mining techniques

被引:28
作者
Nahar, Khaledun [1 ]
Shova, Boishakhe Islam [1 ]
Ria, Tahmina [1 ]
Rashid, Humayara Binte [1 ]
Islam, A. H. M. Saiful [1 ]
机构
[1] Notre Dame Univ Bangladesh, Dept CSE, 2-A Arambagh, Dhaka 1000, Bangladesh
关键词
Data mining techniques; Ensemble learning; Model building; Prediction;
D O I
10.1007/s10639-021-10575-3
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Information is everywhere in a hidden and scattered way. It becomes useful when we apply Data mining to extracts the hidden, meaningful, and potentially useful patterns from these vast data resources. Educational data mining ensures a quality education by analyzing educational data based on various aspects. In this paper, we have analyzed the academic results and behavior of some engineering students. For this study, we collect data from 80 students from the CSE department. We gather data from mark sheets and other relevant factors that accelerate the results, collected through a survey. Our main goal is to predict the students' performance. According to this prediction, the counseling department will guide them in advance so that those who are likely to have bad results can do better. The classification can be based on various aspects, as many factors improve the educational system. We have created two datasets focusing on two different angles. Our first dataset classifies and predicts the category of a student (good, bad, medium) on a specific course based on their prerequisite course performance. We have implemented this in the artificial intelligence course. Our second dataset also classifies and predicts the final grade (A, B, C) of any random subject, here we organize our data such a way where it will only focus on how their performance was till the midterm exam. We analyze and compare six classification algorithms. We have focused on all aspects of an algorithm, not only the accuracy level but also the complexity and cost. We have built two final models for two of our datasets based on a decision tree and the naive Bayes algorithms accordingly.
引用
收藏
页码:6051 / 6067
页数:17
相关论文
共 16 条
[1]  
Ahmed S, 2014, 2014 17TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), P314, DOI 10.1109/ICCITechn.2014.7073107
[2]  
Alasadi S. A., 2017, J ENG APPL SCI, V12, DOI DOI 10.3923/JEASCI.2017.4102.4107
[3]  
[Anonymous], 2011, Int. J. Comput. Appl.
[4]  
[Anonymous], 2017, 2 INT C ELECT ELECT
[5]  
Bhardwaj B.K., 2012, ARXIV PREPRINT ARXIV
[6]  
Bhargavi P, 2009, INT J COMPUT SCI NET, V9, P117
[7]   Dynamics of projective adaptive resonance theory model: The foundation of PART algorithm [J].
Cao, YQ ;
Wu, JH .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2004, 15 (02) :245-260
[8]   Predicting Academic Performance of Students Using a Hybrid Data Mining Approach [J].
Francis, Bindhia K. ;
Babu, Suvanam Sasidhar .
JOURNAL OF MEDICAL SYSTEMS, 2019, 43 (06)
[9]  
GarcYa S., 2015, FRANCISCO HERRERA DA
[10]  
Hussain S, 2018, INDONESIAN J ELECT E, V9, P447, DOI [10.11591/ijeecs.v9.i2.pp447-459, DOI 10.11591/IJEECS.V9.I2.PP447-459]