Enhanced binary genetic algorithm as a feature selection to predict student performance

被引:0
作者
Salam Salameh Shreem
Hamza Turabieh
Sana Al Azwari
Faiz Baothman
机构
[1] HLT,Data Science Department
[2] Taif University,Information Technology Department, Collage of Computing and Information Technology
[3] Taif University,Computer Science Department, Collage of Computing and Information Technology
来源
Soft Computing | 2022年 / 26卷
关键词
Genetic algorithm; Feature selection; Students performance; Electromagnetic-like mechanism;
D O I
暂无
中图分类号
学科分类号
摘要
Students’ performance prediction systems play a vital role in enhancing the educational performance inside universities, schools, and training centers. Big data can come from different resources such as examination centers, virtual courses, registration departments, e-learning systems. Extracting meaningful knowledge from educational data is a complex task, so reducing the data dimensionality is needed. In this paper, we proposed an enhanced binary genetic algorithm (EBGA) as a wrapper feature selection algorithm. Novel hybrid selection mechanism based on a k-means algorithm and electromagnetic-like mechanism (EM) method is proposed. K-means will cluster the population into a set of clusters, while EM will determine a value called a total force (TF) for each solution. Each cluster has an accumulated total force (ATF) (i.e., adding all TFs together). Selection process will select two solutions with the highest TF from the cluster, which has the highest ATF. We employed a hybrid machine learning approach between the proposed EBGA and five different classifiers (i.e., k-Nearest Neighbors (k-NN), Decision Trees (DT), Naive Bayes (NB), Support Vector Machine (SVM), and Linear Discriminant Analysis (LDA)). Two real case studies obtained from UCI Machine Learning Repository are used in this paper. Obtained results showed the ability of the proposed approach to enhance the performance of the binary genetic algorithm. Moreover, the performances of all classifiers are improved between 1%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1\%$$\end{document} and 11%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$11\%$$\end{document}.
引用
收藏
页码:1811 / 1823
页数:12
相关论文
共 113 条
  • [1] Abdullah Z(2011)Mining significant association rules from educational data using critical relative support approach Procedia Soc Behav Sci 7 31883-31902
  • [2] Herawan T(2019)Survey of state-of-the-art mixed data clustering algorithms IEEE Access 37 13-49
  • [3] Ahmad N(2019)Educational data mining and learning analytics for 21st century higher education: a review and synthesis Telemat Inf 1 3-17
  • [4] Deris MM(2009)The state of educational data mining in 2009: a review and future visions J Edu Data Min 25 263-282
  • [5] Ahmad A(2003)An electromagnetism-like mechanism for global optimization J Global Optim 42 5508-5521
  • [6] Khan SS(2015)Data mining models for student careers Exp Syst Appl 16 313-320
  • [7] Aldowah H(2004)Auc optimization vs. error rate minimization Adv Neural Inf Process Syst 24 2429-2452
  • [8] Al-Samarraie H(2019)Machine learning model to predict an adult learner’s decision to continue esol course Educ Inf Technol 79 909-919
  • [9] Fauzy WM(2018)Iot-based students interaction framework using attention-scoring assessment in elearning Future Gener Comput Syst 94 335-343
  • [10] Baker RS(2019)Educational data mining: predictive analysis of academic performance of public school students in the capital of Brazil J Bus Res 43 162-88