Predicting academic performance using tree-based machine learning models: A case study of bachelor students in an engineering department in China

被引:0
作者
Wei Zhang
Yu Wang
Suyu Wang
机构
[1] South China Agricultural University,College of Water Conservancy and Civil Engineering
来源
Education and Information Technologies | 2022年 / 27卷
关键词
Educational data mining; Quality of teaching and learning; Classification; Decision tree; Engineering education; Department administration;
D O I
暂无
中图分类号
学科分类号
摘要
Educational data mining (DEM) provides valuable educational information by applying data mining tools and techniques to analyze data at educational institutions. In this paper, tree-based machine learning algorithms are used to predict students’ overall academic performance in their bachelor’s program. The transcript data of the students in the same department in a Chinese university were collected. All the courses in the bachelor’s program were then divided into six typical categories, and the mean GPAs of each category were taken as primary input features for prediction. Three tree-based machine learning models were established, i.e. decision tree (DT), Gradient boosting decision tree (GBDT) and random forest (RF). Results show that we can successfully identify more than 80% of the students at low-performance risk using the RF model at the end of the second semester, which is meaningful because the global quality of teaching and learning of the department can be improved by taking targeted measures in time according to the machine learning model. Feature importance and the structure of decision tree were also analyzed to extract knowledge that is valuable for both students and teachers. The results of this case study can be used as a reference for other engineering departments in China.
引用
收藏
页码:13051 / 13066
页数:15
相关论文
共 50 条
  • [31] Analysing the Determinants of Surface Solar Radiation with Tree-Based Machine Learning Methods: Case of Istanbul
    Guven, Denizhan
    PURE AND APPLIED GEOPHYSICS, 2024, 181 (05) : 1633 - 1659
  • [32] A comparative study of machine learning and deep learning algorithms for predicting student's academic performance
    Bhushan, Megha
    Vyas, Satyam
    Mall, Shrey
    Negi, Arun
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2023, 14 (06) : 2674 - 2683
  • [33] A comparative study of machine learning and deep learning algorithms for predicting student’s academic performance
    Megha Bhushan
    Satyam Vyas
    Shrey Mall
    Arun Negi
    International Journal of System Assurance Engineering and Management, 2023, 14 : 2674 - 2683
  • [34] Spatial Mapping of Flood Susceptibility Using Decision Tree-Based Machine Learning Models for the Vembanad Lake System in Kerala, India
    Sundar, Parthasarathy Kulithalai Shiyam
    Kundapura, Subrahmanya
    JOURNAL OF WATER RESOURCES PLANNING AND MANAGEMENT, 2023, 149 (10)
  • [35] Classification Prediction of PM10 Concentration Using a Tree-Based Machine Learning Approach
    Shaziayani, Wan Nur
    Ul-Saufie, Ahmad Zia
    Mutalib, Sofianita
    Noor, Norazian Mohamad
    Zainordin, Nazatul Syadia
    ATMOSPHERE, 2022, 13 (04)
  • [36] Indicators of Engineering Students' Academic Performance: A Gender-Based Study
    Nagahi, Morteza
    Jaradat, Raed
    Hossain, Niamat Ullah Ibne
    Nagahisarchoghaei, Mohammad
    Elakramine, Fatine
    Georger, Simon R.
    2020 14TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE (SYSCON2020), 2020,
  • [37] Comparison of Machine Learning Tree-Based Algorithms to Predict Future Paratuberculosis ELISA Results Using Repeat Milk Tests
    Imada, Jamie
    Arango-Sabogal, Juan Carlos
    Bauman, Cathy
    Roche, Steven
    Kelton, David
    ANIMALS, 2024, 14 (07):
  • [38] Predicting Students' Performance of the Private Universities of Bangladesh using Machine Learning Approaches
    Zulfiker, Md Sabab
    Kabir, Nasrin
    Biswas, Al Amin
    Chakraborty, Partha
    Rahman, Mahfujur
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (03) : 672 - 679
  • [39] A Systematic Review on Predicting the Performance of Students in Higher Education in Offline Mode Using Machine Learning Techniques
    Rahul Rahul
    Wireless Personal Communications, 2023, 133 : 1643 - 1674
  • [40] A Systematic Review on Predicting the Performance of Students in Higher Education in Offline Mode Using Machine Learning Techniques
    Rahul
    Katarya, Rahul
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 133 (03) : 1517 - 1546