Data Mining Approach to Predict Success of Secondary School Students: A Saudi Arabian Case Study

被引:22
作者
Alghamdi, Amnah Saeed [1 ]
Rahman, Atta [1 ]
机构
[1] Imam Abdulrahman Bin Faisal Univ, Coll Comp Sci & Informat Technol, Dept Comp Sci, Dammam 31441, Saudi Arabia
关键词
machine learning; educational data mining; secondary school; prediction; academic performance;
D O I
10.3390/educsci13030293
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
A problem that pervades throughout students' careers is their poor performance in high school. Predicting students' academic performance helps educational institutions in many ways. Knowing and identifying the factors that can affect the academic performance of students at the beginning of the thread can help educational institutions achieve their educational goals by providing support to students earlier. The aim of this study was to predict the achievement of early secondary students. Two sets of data were used for high school students who graduated from the Al-Baha region in the Kingdom of Saudi Arabia. In this study, three models were constructed using different algorithms: Naive Bayes (NB), Random Forest (RF), and J48. Moreover, the Synthetic Minority Oversampling Technique (SMOTE) technique was applied to balance the data and extract features using the correlation coefficient. The performance of the prediction models has also been validated using 10-fold cross-validation and direct partition in addition to various performance evaluation metrics: accuracy curve, true positive (TP) rate, false positive (FP) rate, accuracy, recall, F-Measurement, and receiver operating characteristic (ROC) curve. The NB model achieved a prediction accuracy of 99.34%, followed by the RF model with 98.7%.
引用
收藏
页数:24
相关论文
共 50 条
[21]   Environmental awareness level of secondary school students: A case study in Balikesir (Turkiye) [J].
Altin, Ahmet ;
Tecer, Selcen ;
Tecer, Lokman ;
Altin, Sureyya ;
Kahraman, Bekir Fatih .
4TH WORLD CONFERENCE ON LEARNING TEACHING AND EDUCATIONAL LEADERSHIP (WCLTA-2013), 2014, 141 :1208-1214
[22]   Data Mining Approach to Predict Air Pollution in Makassar [J].
Aini, Nurul ;
Mustafa, M. Syukri .
PROCEEDINGS OF ICORIS 2020: 2020 THE 2ND INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEM (ICORIS), 2020, :73-77
[23]   A data-mining approach to predict influent quality [J].
Kusiak, Andrew ;
Verma, Anoop ;
Wei, Xiupeng .
ENVIRONMENTAL MONITORING AND ASSESSMENT, 2013, 185 (03) :2197-2210
[24]   A data-mining approach to predict influent quality [J].
Andrew Kusiak ;
Anoop Verma ;
Xiupeng Wei .
Environmental Monitoring and Assessment, 2013, 185 :2197-2210
[25]   Classification and Prediction based Data Mining Algorithms to Predict Students' Introductory programming Performance [J].
Sivasakthi, M. .
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTING AND INFORMATICS (ICICI 2017), 2017, :346-350
[26]   Prevalence of smoking among male secondary school students in Jeddah, Saudi Arabia [J].
Fida, Hashim R. ;
Abdelmoneim, Ismail .
JOURNAL OF FAMILY AND COMMUNITY MEDICINE, 2013, 20 (03) :168-172
[27]   Apply of Sum of Difference Method to Predict Placement of Students' Using Educational Data Mining [J].
Ramanathan, L. ;
Geetha, Angelina ;
Khalid, M. ;
Swarnalatha, P. .
INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 1, INDIA 2016, 2016, 433 :367-377
[28]   Predicting dropout at master level using educational data mining: A case of public health students in saudi arabia [J].
Alhamad, Ibrahim Abdullah ;
Singh, Harman Preet .
AMAZONIA INVESTIGA, 2024, 13 (74) :264-275
[29]   Using Data Mining in Educational Administration: A Case Study on Improving School Attendance [J].
Moodley, Raymond ;
Chiclana, Francisco ;
Carter, Jenny ;
Caraffini, Fabio .
APPLIED SCIENCES-BASEL, 2020, 10 (09)
[30]   Role of Secondary Attributes to Boost the Prediction Accuracy of Students' Employability Via Data Mining [J].
Thakar, Pooja ;
Mehta, Anil ;
Manisha .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2015, 6 (11) :84-90