Data Mining Approach to Predict Success of Secondary School Students: A Saudi Arabian Case Study

被引:22
作者
Alghamdi, Amnah Saeed [1 ]
Rahman, Atta [1 ]
机构
[1] Imam Abdulrahman Bin Faisal Univ, Coll Comp Sci & Informat Technol, Dept Comp Sci, Dammam 31441, Saudi Arabia
关键词
machine learning; educational data mining; secondary school; prediction; academic performance;
D O I
10.3390/educsci13030293
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
A problem that pervades throughout students' careers is their poor performance in high school. Predicting students' academic performance helps educational institutions in many ways. Knowing and identifying the factors that can affect the academic performance of students at the beginning of the thread can help educational institutions achieve their educational goals by providing support to students earlier. The aim of this study was to predict the achievement of early secondary students. Two sets of data were used for high school students who graduated from the Al-Baha region in the Kingdom of Saudi Arabia. In this study, three models were constructed using different algorithms: Naive Bayes (NB), Random Forest (RF), and J48. Moreover, the Synthetic Minority Oversampling Technique (SMOTE) technique was applied to balance the data and extract features using the correlation coefficient. The performance of the prediction models has also been validated using 10-fold cross-validation and direct partition in addition to various performance evaluation metrics: accuracy curve, true positive (TP) rate, false positive (FP) rate, accuracy, recall, F-Measurement, and receiver operating characteristic (ROC) curve. The NB model achieved a prediction accuracy of 99.34%, followed by the RF model with 98.7%.
引用
收藏
页数:24
相关论文
共 53 条
[1]   Data mining approach to predicting the performance of first year student in a university using the admission requirements [J].
Adekitan, Aderibigbe Israel ;
Noma-Osaghae, Etinosa .
EDUCATION AND INFORMATION TECHNOLOGIES, 2019, 24 (02) :1527-1543
[2]  
Aggarwal V.B., 2015, ADV INTELLIGENT SYST
[3]   A Real-Time Computer Vision Based Approach to Detection and Classification of Traffic Incidents [J].
Ahmed, Mohammed Imran Basheer ;
Zaghdoud, Rim ;
Ahmed, Mohammed Salih ;
Sendi, Razan ;
Alsharif, Sarah ;
Alabdulkarim, Jomana ;
Saad, Bashayr Adnan Albin ;
Alsabt, Reema ;
Rahman, Atta ;
Krishnasamy, Gomathi .
BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (01)
[4]   Using Word Embedding and Ensemble Learning for Highly Imbalanced Data Sentiment Analysis in Short Arabic Text [J].
Al-Azani, Sadam ;
El-Alfy, El-Sayed M. .
8TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2017) AND THE 7TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT 2017), 2017, 109 :359-366
[5]  
Alhassan A.M., 2020, THESIS ABDULAZIZ U J
[6]   An enhanced J48 classification algorithm for the anomaly intrusion detection systems [J].
Aljawarneh, Shadi ;
Yassein, Muneer Bani ;
Aljundi, Mohammed .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 5) :10549-10565
[7]   Arabic Tweets-Based Sentiment Analysis to Investigate the Impact of COVID-19 in KSA: A Deep Learning Approach [J].
Alqarni, Arwa ;
Rahman, Atta .
BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (01)
[8]  
Alyahyan E, 2020, 2020 2 INT C COMP IN, DOI [10.1109/ICCIS49240.2020.9257646, DOI 10.1109/ICCIS49240.2020.9257646]
[9]  
[Anonymous], 2009, SIGKDD Explorations, DOI [10.1145/1656274.1656278, DOI 10.1145/1656274.1656278]
[10]  
[Anonymous], ED SAUDI ARABIA