Empowering education: Harnessing ensemble machine learning approach and ACO-DT classifier for early student academic performance prediction

被引:0
|
作者
Mahawar, Kajal [1 ]
Rattan, Punam [1 ]
机构
[1] Lovely Profess Univ, Sch Comp Applicat, Phagwara, Punjab, India
关键词
Machine learning; Students' performance; Multivariate ensemble model; ML classifiers; Feature selection; Ant Colony Optimization;
D O I
10.1007/s10639-024-12976-6
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Higher education institutions have consistently strived to provide students with top-notch education. To achieve better outcomes, machine learning (ML) algorithms greatly simplify the prediction process. ML can be utilized by academicians to obtain insight into student data and mine data for forecasting the performance. In this paper, the authors proposed an ML-based student prediction model based on the demographic, social, psychological, and economic factors, collectively. The dataset utilized for this study was compiled from a designed questionnaire administered to second-year undergraduate students. The objective of this study is to uncover factors that could assist in predicting students' performance. Eight ML classifiers, logistic regression, random forest, support vector machine, XGBoost, support vector machine with a linear kernel, na & iuml;ve Bayes, K-Nearest Neighbor, and decision tree are used to forecast student performance. Additionally, nine feature selection techniques, variance threshold, XGBoost, feature importance, recursive feature elimination, chi-square, ridge, Pearson correlation, lasso, and random forest, are employed to determine optimal factors. The authors experimented with each technique by creating two sets of training and testing data with 80:20 and 70:30 proportions, respectively. Comparatively, the ensemble DXK (DT + XGB + KNN) model with cross-validation and 80:20 proportions outperformed other standard classifiers, achieving a highest accuracy of 97.83%, an r-square of 96.17%, a precision of 97.94%, a recall of 97.83%, and an f1-score of 97.88%. These were the highest among all models tested. Additionally, the authors propose the ACO-DT model, which improves the prediction performance of the top-performing DT classifier by utilizing the Ant Colony Optimization technique. The findings demonstrate that the proposed model with 80:20 proportions achieve an accuracy of 98.15%, an f1-score of 98.16%, a precision of 98.18%, a recall of 98.15%, and an r-square of 84.75%, surpassing all other models for forecasting student performance. Using the specified data size, this model creation time is 8.49 s. The authors also recommended the future research directions to further enhance this study.
引用
收藏
页码:4639 / 4667
页数:29
相关论文
共 30 条
  • [1] CatBoost - An Ensemble Machine Learning Model for Prediction and Classification of Student Academic Performance
    Joshi, Abhisht
    Saggar, Pranay
    Jain, Rajat
    Sharma, Moolchand
    Gupta, Deepak
    Khanna, Ashish
    ADVANCES IN DATA SCIENCE AND ADAPTIVE ANALYSIS, 2021, 13 (03N04)
  • [2] A Machine Learning Ensemble Classifier for Early Prediction of Diabetic Retinopathy
    Somasundaram, S. K.
    Alli, P.
    JOURNAL OF MEDICAL SYSTEMS, 2017, 41 (12)
  • [3] A Machine Learning Ensemble Classifier for Early Prediction of Diabetic Retinopathy
    Somasundaram S K
    Alli P
    Journal of Medical Systems, 2017, 41
  • [4] Harnessing Ensemble in Machine Learning for Accurate Early Prediction and Prevention of Heart Disease
    Husain, Mohammad
    Kumar, Pankaj
    Ahmed, Mohammad Nadeem
    Ali, Arshad
    Rasool, Mohammad Ashiquee
    Hussain, Mohammad Rashid
    Dildar, Muhammad Shahid
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 182 - 195
  • [5] Study on Feature Engineering and Ensemble Learning for Student Academic Performance Prediction
    Du Xiaoming
    Chen Ying
    Zhang Xiaofang
    Guo Yu
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (05) : 495 - 502
  • [6] A Prediction Model for Student Academic Performance Using Machine Learning
    Kaur H.
    Kaur T.
    Garg R.
    Informatica (Slovenia), 2023, 47 (01): : 97 - 108
  • [7] Machine learning approach to student performance prediction of online learning
    Wang, Jing
    Yu, Yun
    PLOS ONE, 2025, 20 (01):
  • [8] Predicting student specializations: a Machine Learning Approach based on Academic Performance
    Angeioplastis, Athanasios
    Papaioannou, Nikolaos
    Tsimpiris, Alkiviadis
    Kamilali, Angeliki
    Varsamis, Dimitrios
    JOURNAL OF E-LEARNING AND KNOWLEDGE SOCIETY, 2024, 20 (02): : 19 - 27
  • [9] Contributions of Machine Learning Models towards Student Academic Performance Prediction: A Systematic Review
    Balaji, Prasanalakshmi
    Alelyani, Salem
    Qahmash, Ayman
    Mohana, Mohamed
    APPLIED SCIENCES-BASEL, 2021, 11 (21):
  • [10] Student academic success prediction in multimedia-supported virtual learning system using ensemble learning approach
    Saidani O.
    Umer M.
    Alshardan A.
    Alturki N.
    Nappi M.
    Ashraf I.
    Multimedia Tools and Applications, 2024, 83 (40) : 87553 - 87578