Supervised machine learning algorithms for predicting student dropout and academic success: a comparative study

被引:3
|
作者
Villar A. [1 ]
de Andrade C.R.V. [2 ]
机构
[1] Faculty of Computer Science, University of Essex, Essex, Colchester
[2] Faculty of Applied Mathematics, University of Sao Paulo, Sao Paulo, Sao Paulo
来源
Discover Artificial Intelligence | 2024年 / 4卷 / 01期
关键词
Boosting algorithms; Class imbalance; Dropout prediction; Hyperparameters; Learning analytics; Supervised classification algorithms; Unsupervised classification algorithms;
D O I
10.1007/s44163-023-00079-z
中图分类号
学科分类号
摘要
Utilizing a dataset sourced from a higher education institution, this study aims to assess the efficacy of diverse machine learning algorithms in predicting student dropout and academic success. Our focus was on algorithms capable of effectively handling imbalanced data. To tackle class imbalance, we employed the SMOTE resampling technique. We applied a range of algorithms, including Decision Tree (DT), Support Vector Machine (SVM), Random Forest (RF), as well as boosting algorithms such as Gradient Boosting (GB), Extreme Gradient Boosting (XGBoost), CatBoost (CB), and Light Gradient Boosting Machine (LB). To enhance the models' performance, we conducted hyperparameter tuning using Optuna. Additionally, we employed the Isolation Forest (IF) method to identify outliers or anomalies within the dataset. Notably, our findings indicate that boosting algorithms, particularly LightGBM and CatBoost with Optuna, outperformed traditional classification methods. Our study's generalizability to other contexts is constrained due to its reliance on a single dataset, with inherent limitations. Nevertheless, this research provides valuable insights into the effectiveness of various machine learning algorithms for predicting student dropout and academic success. By benchmarking these algorithms, our project offers guidance to both researchers and practitioners in their choice of suitable approaches for similar predictive tasks. © The Author(s) 2024.
引用
收藏
相关论文
共 50 条
  • [1] Predicting Student Dropout and Academic Success
    Realinho, Valentim
    Machado, Jorge
    Baptista, Luis
    Martins, Monica V.
    DATA, 2022, 7 (11)
  • [2] A comparative study of machine learning and deep learning algorithms for predicting student's academic performance
    Bhushan, Megha
    Vyas, Satyam
    Mall, Shrey
    Negi, Arun
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2023, 14 (06) : 2674 - 2683
  • [3] A comparative study of machine learning and deep learning algorithms for predicting student’s academic performance
    Megha Bhushan
    Satyam Vyas
    Shrey Mall
    Arun Negi
    International Journal of System Assurance Engineering and Management, 2023, 14 : 2674 - 2683
  • [4] Machine Learning Based Predicting Student Academic Success
    Al Mayahi, Khalfan
    Al-Bahri, Mahmood
    2020 12TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT 2020), 2020, : 264 - 268
  • [5] Analyzing Supervised Learning Models for Predicting Student Dropout and Success in Higher Education
    Bhurre, Shraddha
    Prajapat, Shaligram
    ADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS, UKCI 2023, 2024, 1453 : 234 - 248
  • [6] Predicting the percentage of student placement: A comparative study of machine learning algorithms
    Cakit, Erman
    Dagdeviren, Metin
    EDUCATION AND INFORMATION TECHNOLOGIES, 2022, 27 (01) : 997 - 1022
  • [7] Predicting the percentage of student placement: A comparative study of machine learning algorithms
    Erman Çakıt
    Metin Dağdeviren
    Education and Information Technologies, 2022, 27 : 997 - 1022
  • [8] Predicting Student Success Using Big Data and Machine Learning Algorithms
    Ouatik, Farouk
    Erritali, Mohammed
    Ouatik, Fahd
    Jourhmane, Mostafa
    INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2022, 17 (12): : 236 - 251
  • [9] Comparative study of supervised learning algorithms for student performance prediction
    Mohammadi, Mehdi
    Dawodi, Mursal
    Tomohisa, Wada
    Ahmadi, Nadira
    2019 1ST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (ICAIIC 2019), 2019, : 124 - 127
  • [10] Predicting High School Students' Academic Performance: A Comparative Study of Supervised Machine Learning Techniques
    Sanchez-Pozo, Nadia N.
    Mejia-Ordonez, Juan S.
    Chamorro, Diana C.
    Mayorca-Torres, Dagoberto
    Peluffo-Ordonez, Diego H.
    2021 MACHINE LEARNING-DRIVEN DIGITAL TECHNOLOGIES FOR EDUCATIONAL INNOVATION WORKSHOP, 2021,