Predicting academic performance using tree-based machine learning models: A case study of bachelor students in an engineering department in China

被引:0
作者
Wei Zhang
Yu Wang
Suyu Wang
机构
[1] South China Agricultural University,College of Water Conservancy and Civil Engineering
来源
Education and Information Technologies | 2022年 / 27卷
关键词
Educational data mining; Quality of teaching and learning; Classification; Decision tree; Engineering education; Department administration;
D O I
暂无
中图分类号
学科分类号
摘要
Educational data mining (DEM) provides valuable educational information by applying data mining tools and techniques to analyze data at educational institutions. In this paper, tree-based machine learning algorithms are used to predict students’ overall academic performance in their bachelor’s program. The transcript data of the students in the same department in a Chinese university were collected. All the courses in the bachelor’s program were then divided into six typical categories, and the mean GPAs of each category were taken as primary input features for prediction. Three tree-based machine learning models were established, i.e. decision tree (DT), Gradient boosting decision tree (GBDT) and random forest (RF). Results show that we can successfully identify more than 80% of the students at low-performance risk using the RF model at the end of the second semester, which is meaningful because the global quality of teaching and learning of the department can be improved by taking targeted measures in time according to the machine learning model. Feature importance and the structure of decision tree were also analyzed to extract knowledge that is valuable for both students and teachers. The results of this case study can be used as a reference for other engineering departments in China.
引用
收藏
页码:13051 / 13066
页数:15
相关论文
共 50 条
  • [21] Predicting Joining Behavior of Freshmen Students using Machine Learning - A Case Study
    Kumar, Pawan
    Kumar, Varun
    Sobti, Rajeev
    2020 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2020), 2020, : 141 - 145
  • [22] Feature optimization and machine learning for predicting students' academic performance in higher education institutions
    Perkash, Aom
    Shaheen, Qaisar
    Saleem, Robina
    Rustam, Furqan
    Villar, Monica Gracia
    Alvarado, Eduardo Silva
    Diez, Isabel de la Torre
    Ashraf, Imran
    EDUCATION AND INFORMATION TECHNOLOGIES, 2024, 29 (16) : 21169 - 21193
  • [23] Improving Students' Performance by Interpretable Explanations using Ensemble Tree-Based Approaches
    Vultureanu-Albisi, Alexandra
    Badica, Costin
    IEEE 15TH INTERNATIONAL SYMPOSIUM ON APPLIED COMPUTATIONAL INTELLIGENCE AND INFORMATICS (SACI 2021), 2021, : 215 - 220
  • [24] Educational data mining: prediction of students' academic performance using machine learning algorithms
    Yagci, Mustafa
    SMART LEARNING ENVIRONMENTS, 2022, 9 (01)
  • [25] Predicting drug-target interaction based on bilateral local models using a decision tree-based hybrid support vector machine
    Sorkhi, Ali Ghanbari
    Mobarakeh, Majid Iranpour
    Hashemi, Seyed Mohammad Reza
    Faridpour, Maryam
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2021, 12 (02): : 133 - +
  • [26] Imbalanced learning for insurance using modified loss functions in tree-based models
    Hu, Changyue
    Quan, Zhiyu
    Chong, Wing Fung
    INSURANCE MATHEMATICS & ECONOMICS, 2022, 106 : 13 - 32
  • [27] Comparing Different Resampling Methods in Predicting Students Performance Using Machine Learning Techniques
    Ghorbani, Ramin
    Ghousi, Rouzbeh
    IEEE ACCESS, 2020, 8 : 67899 - 67911
  • [28] Robustness of Optimized Decision Tree-Based Machine Learning Models to Map Gully Erosion Vulnerability
    Eloudi, Hasna
    Hssaisoune, Mohammed
    Reddad, Hanane
    Namous, Mustapha
    Ismaili, Maryem
    Krimissa, Samira
    Ouayah, Mustapha
    Bouchaou, Lhoussaine
    SOIL SYSTEMS, 2023, 7 (02)
  • [29] At-Risk Students Identification based on Machine Learning Approach: A Case Study of Computer Science Bachelor Student in Tunisia
    Khalifa, Amani
    BenSaid, Fatma
    Kacem, Yessine Hadj
    Jridi, Zouhaier
    2023 20TH ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, AICCSA, 2023,
  • [30] Explainable AI and tree-based ensemble models: a comparative study in predicting chemical pulmonary toxicity
    Jaganathan, Keerthana
    Geethika, P. R.
    Ramakrishnan, Shanmugam
    Sundaram, Dhanasekar
    EUROPEAN PHYSICAL JOURNAL-SPECIAL TOPICS, 2024,