Systematic ensemble model selection approach for educational data mining

被引:75
|
作者
Injadat, MohammadNoor [1 ]
Moubayed, Abdallah [1 ]
Nassif, Ali Bou [1 ,2 ]
Shami, Abdallah [1 ]
机构
[1] Univ Western Ontario, Elect & Comp Engn Dept, London, ON, Canada
[2] Univ Sharjah, Comp Engn Dept, Sharjah, U Arab Emirates
关键词
e-learning; Student performance prediction; Educational data mining; Ensemble learning model selection; Gini index; p-value; PREDICTING ACADEMIC-PERFORMANCE;
D O I
10.1016/j.knosys.2020.105992
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A plethora of research has been done in the past focusing on predicting student's performance in order to support their development. Many institutions are focused on improving the performance and the education quality; and this can be achieved by utilizing data mining techniques to analyze and predict students' performance and to determine possible factors that may affect their final marks. To address this issue, this work starts by thoroughly exploring and analyzing two different datasets at two separate stages of course delivery (20% and 50% respectively) using multiple graphical, statistical, and quantitative techniques. The feature analysis provides insights into the nature of the different features considered and helps in the choice of the machine learning algorithms and their parameters. Furthermore, this work proposes a systematic approach based on Gini index and p-value to select a suitable ensemble learner from a combination of six potential machine learning algorithms. Experimental results show that the proposed ensemble models achieve high accuracy and low false positive rate at all stages for both datasets. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [11] Educational Data Mining Model Using Rattle
    Hussain, Sadiq
    Hazarika, G. C.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (06) : 22 - 27
  • [12] Educational data mining: A review
    Mohamad, Siti Khadijah
    Tasir, Zaidatun
    9TH INTERNATIONAL CONFERENCE ON COGNITIVE SCIENCE, 2013, 97 : 320 - 324
  • [13] Performance Analysis of Feature Selection Algorithm for Educational Data Mining
    Zaffar, Maryam
    Hashmani, Manzoor Ahmed
    Savita, K. S.
    2017 IEEE CONFERENCE ON BIG DATA AND ANALYTICS (ICBDA), 2017, : 7 - 12
  • [14] An Intelligent Prediction System for Educational Data Mining Based on Ensemble and Filtering approaches
    Ashraf, Mudasir
    Zaman, Majid
    Ahmed, Muheet
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 1471 - 1483
  • [15] Educational data mining: a systematic review of research and emerging trends
    Du, Xu
    Yang, Juan
    Hung, Jui-Long
    Shelton, Brett
    INFORMATION DISCOVERY AND DELIVERY, 2020, 48 (04) : 225 - 236
  • [16] Educational Data Mining: A Mining Model for Developing Students' Programming Skills
    Pathan, Asraful Alam
    Hasan, Mehedi
    Ahmed, Md. Ferdous
    Farid, Dewan Md.
    8TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT AND APPLICATIONS (SKIMA 2014), 2014,
  • [17] Educational Data Mining Based on Multi-objective Weighted Voting Ensemble Classifier
    Abdar, Moloud
    Yen, Neil Y.
    Hung, Jason C.
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, : 357 - 362
  • [18] ACADEMIC ANALYTICS AND EDUCATIONAL DATA MINING AT THE UNIVERSITY LEVEL: A SYSTEMATIC REVIEW
    Chavarry Chankay, Mariana
    Aquino Trujillo, Jury Yesenia
    Li Vega, Fiorella Vanessa
    German Reyes, Nilton Cesar
    REVISTA UNIVERSIDAD Y SOCIEDAD, 2022, 14 : 377 - 390
  • [19] Self-Regulated Learning Model in Educational Data Mining
    Nuankaewo, Pratya
    INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2022, 17 (17) : 4 - 27
  • [20] Ordinal regression by a gravitational model in the field of educational data mining
    Gomez-Rey, Pilar
    Fernandez-Navarro, Francisco
    Barbera, Elena
    EXPERT SYSTEMS, 2016, 33 (02) : 161 - 175