Student Performance Prediction with Decision Tree Ensembles and Feature Selection Techniques

被引:0
作者
Ahmad, Amir [1 ]
Ray, Santosh [2 ]
Khan, Md. Tabrej [3 ]
Nawaz, Ali [1 ]
机构
[1] United Arab Emirates Univ, Coll Informat Technol, Al Ain, U Arab Emirates
[2] Liwa Coll, Fac Informat Technol, Abu Dhabi, U Arab Emirates
[3] Pacific Acad Higher Educ & Res Univ, Fac Comp Sci, Udaipur, Rajasthan, India
关键词
Student dropout prediction; classification; ensembles; decision trees; imbalanced class; feature selection; CLASSIFICATION; PROJECTION; SMOTE;
D O I
10.1142/S0219649225500169
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
The prevalence of student dropout in academic settings is a serious issue that affects individuals and society as a whole. Timely intervention and support can be provided to such students if we get an accurate prediction of student performance. However, class imbalance and data complexity in education data are major challenges for traditional predictive analytics. Our research focusses on utilising machine learning techniques to predict student performance while handling imbalanced datasets. To address the imbalanced class problem, we employed both oversampling and undersampling techniques in our decision tree ensemble methods for the risk classification of prospective students. The effectiveness of classifiers was evaluated by varying the sizes of the ensembles and the oversampling and undersampling ratios. Additionally, we conducted experiments to integrate the feature selection processes with the best ensemble classifiers to further enhance the prediction. Based on the extensive experimentation, we concluded that ensemble methods such as Random Forest, Bagging, and Random Undersampling Boosting perform well in terms of performance measures such as Recall, Precision, F1-score, Area Under the Receiver Operating Characteristic Curve, and Geometric Mean. The F1-score of 0.849 produced by the Random Undersampling Boost classifier in conjunction with the Least Absolute Shrinkage and Selection Operator feature selection method indicates that this ensemble produces the best results.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Greedy Algorithm for Deriving Decision Rules from Decision Tree Ensembles
    Tetteh, Evans Teiko
    Zielosko, Beata
    [J]. ENTROPY, 2025, 27 (01)
  • [22] Stable feature selection for clinical prediction: Exploiting ICD tree structure using Tree-Lasso
    Kamkar, Iman
    Gupta, Sunil Kumar
    Dinh Phung
    Venkatesh, Svetha
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 53 : 277 - 290
  • [23] The Impact of Feature Selection on Defect Prediction Performance: An Empirical Comparison
    Xu, Zhou
    Liu, Jin
    Yang, Zijiang
    An, Gege
    Jia, Xiangyang
    [J]. 2016 IEEE 27TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2016, : 309 - 320
  • [24] A feature selection algorithm of decision tree based on feature weight
    Zhou, HongFang
    Zhang, JiaWei
    Zhou, YueQing
    Guo, XiaoJie
    Ma, YiMing
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 164
  • [25] Speech emotion recognition based on feature selection and extreme learning machine decision tree
    Liu, Zhen-Tao
    Wu, Min
    Cao, Wei-Hua
    Mao, Jun-Wei
    Xu, Jian-Ping
    Tan, Guan-Zheng
    [J]. NEUROCOMPUTING, 2018, 273 : 271 - 280
  • [26] On developing an automatic threshold applied to feature selection ensembles
    Seijo-Pardo, B.
    Bolon-Canedo, V
    Alonso-Betanzos, A.
    [J]. INFORMATION FUSION, 2019, 45 : 227 - 245
  • [27] A comparative study of combining tree-based feature selection methods and classifiers in personal loan default prediction
    Guo, Weidong
    Zhou, Zach Zhizhong
    [J]. JOURNAL OF FORECASTING, 2022, 41 (06) : 1248 - 1313
  • [28] Empirical validation of feature selection techniques for cross-project defect prediction
    Malhotra, Ruchika
    Meena, Shweta
    [J]. INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (05) : 1743 - 1755
  • [29] On Combining Feature Selection and Over-Sampling Techniques for Breast Cancer Prediction
    Huang, Min-Wei
    Chiu, Chien-Hung
    Tsai, Chih-Fong
    Lin, Wei-Chao
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (14):
  • [30] Software-based Prediction of Liver Disease with Feature Selection and Classification Techniques
    Singh, Jagdeep
    Bagga, Sachin
    Kaur, Ranjodh
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 1970 - 1980