Predicting GPA of University Students with Supervised Regression Machine Learning Models

被引:3
|
作者
Falat, Lukas [1 ]
Piscova, Terezia [1 ]
机构
[1] Univ Zilina, Fac Management Sci & Informat, Univ 8215-1, Zilina 01026, Slovakia
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 17期
关键词
machine learning; prediction; statistical modeling; education; GPA; random forest; linear regression; student; ACADEMIC-SUCCESS; SCIENCE; SUPPORT; SCHOOL;
D O I
10.3390/app12178403
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The paper deals with predicting grade point average (GPA) with supervised machine learning models. Based on the literature review, we divide the factors into three groups-psychological, sociological and study factors. Data from the questionnaire are evaluated using statistical analysis. We use confirmatory data analysis, where we compare the answers of men and women, university students coming from grammar schools versus students coming from secondary vocational schools and students divided according to the average grade. The differences between groups are tested with the Shapiro-Wilk and Mann-Whitney U-test. We identify the factors influencing the GPA through correlation analysis, where we use the Pearson test and the ANOVA. Based on the performed analysis, factors that show a statistically significant dependence with the GPA are identified. Subsequently, we implement supervised machine learning models. We create 10 prediction models using linear regression, decision trees and random forest. The models predict the GPA based on independent variables. Based on the MAPE metric on the five validation sets in cross-validation, the best generalization accuracy is achieved by a random forest model-its average MAPE is 11.13%. Therefore, we recommend the use of a random forest as a starting model for modeling student results.
引用
收藏
页数:25
相关论文
共 50 条
  • [21] Predicting soot formation in fossil fuels: A comparative study of regression and machine learning models
    Lawal, Ridhwan
    Farooq, Wasif
    Abdulraheem, Abdulazeez
    Jameel, Abdul Gani Abdul
    DIGITAL CHEMICAL ENGINEERING, 2024, 12
  • [22] Potential of Machine Learning for Predicting Sleep Disorders: A Comprehensive Analysis of Regression and Classification Models
    Alazaidah, Raed
    Samara, Ghassan
    Aljaidi, Mohammad
    Haj Qasem, Mais
    Alsarhan, Ayoub
    Alshammari, Mohammed
    DIAGNOSTICS, 2024, 14 (01)
  • [23] Machine Learning Algorithms Outperform Conventional Regression Models in Predicting Development of Hepatocellular Carcinoma
    Singal, Amit G.
    Mukherjee, Ashin
    Elmunzer, B. Joseph
    Higgins, Peter D. R.
    Lok, Anna S.
    Zhu, Ji
    Marrero, Jorge A.
    Waljee, Akbar K.
    AMERICAN JOURNAL OF GASTROENTEROLOGY, 2013, 108 (11): : 1723 - 1730
  • [24] Predicting Ozone Pollution in Urban Areas Using Machine Learning and Quantile Regression Models
    Cueva, Fernando
    Saquicela, Victor
    Sarmiento, Juan
    Cabrera, Fanny
    INFORMATION AND COMMUNICATION TECHNOLOGIES (TICEC 2021), 2021, 1456 : 281 - 296
  • [25] Machine learning regression models and associated variables for predicting preoperative anxiety in paediatric patients
    Rahman, Abdul
    Sheikh, Javed Khan
    Raeen, Mohammad Sarwar
    INDIAN JOURNAL OF ANAESTHESIA, 2025, 69 (02) : 248 - 248
  • [26] Predicting Academic Performance of University Students Using Machine Learning: A Case Study in the UK
    Soyoye, Titilayo Olabisi
    Chen, Tianhua
    Hill, Richard
    Mccabe, Keith
    2023 IEEE INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WI-IAT, 2023, : 431 - 434
  • [27] Predicting the Effectiveness of a Mindfulness Virtual Community Intervention for University Students: Machine Learning Model
    El Morr, Christo
    Tavangar, Farideh
    Ahmad, Farah
    Ritvo, Paul
    INTERACTIVE JOURNAL OF MEDICAL RESEARCH, 2024, 13
  • [28] Predicting online gambling self-exclusion: an analysis of the performance of supervised machine learning models
    Percy, Christian
    Franca, Manoel
    Dragicevic, Simo
    Garcez, Artur d'Avila
    INTERNATIONAL GAMBLING STUDIES, 2016, 16 (02) : 193 - 210
  • [29] Predicting students' continuance use of learning management system at a technical university using machine learning algorithms
    Kuadey, Noble Arden
    Mahama, Francois
    Ankora, Carlos
    Bensah, Lily
    Maale, Gerald Tietaa
    Agbesi, Victor Kwaku
    Kuadey, Anthony Mawuena
    Adjei, Laurene
    INTERACTIVE TECHNOLOGY AND SMART EDUCATION, 2023, 20 (02) : 209 - 227
  • [30] Predicting Students Performance Using Supervised Machine Learning Based on Imbalanced Dataset and Wrapper Feature Selection
    Alija S.
    Beqiri E.
    Gaafar A.S.
    Hamoud A.K.
    Informatica (Slovenia), 2023, 47 (01): : 11 - 20