Comparative Analysis of Prediction Techniques to Determine Student Dropout: Logistic Regression vs Decision Trees

被引:0
|
作者
Perez, Alfredo [1 ]
Grandon, Elizabeth E. [1 ]
Caniupan, Monica [1 ]
Vargas, Gilda [2 ]
机构
[1] Univ Bio Bio, Dept Sistemas Informac, Concepcion, Chile
[2] Univ Bio Bio, Dept Estadist, Concepcion, Chile
来源
2018 37TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC) | 2018年
关键词
Student Dropout; Data Mining; SAP; Predictive Analytics; Logistic Regression; Decision Trees; HIGHER EDUCATION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Currently, the detection of students who may drop out from an academic program is a relevant issue for universities, so there are efforts to examine the variables that determine students' drop out. Drop out is defined in different ways, however, all the studies converge in that for a student to drop out a course of study, some variables must be combined. This study presents a comparison of performance indicators of the current drop out model of the Universidad del Bio-Bio (UBB), which is based on logistic regression technique and it is compared with a new model based on decision trees. The new model is obtained through data mining methodologies and it was implemented through the SAP Predictive Analytics tool. To train, validate, and apply the model, real data from the UBB databases were used. The comparison shows that the prediction of student' drop out of the proposed model obtains an accuracy of 86%, a precision of 97% with an error rate of 14%, better indicators than the current values delivered by the model based on logistic regression. Subsequently, the prediction model obtained was optimized considering other variables, improving even more the prediction indicators. Higher education institutions should take into account the variables that explain the most the phenomenon of student's drop out to improve the retention of their students.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Cow Health Prediction Method Based on Logistic Regression and Decision Tree
    Nie, Jiaxin
    Fang, Jiandong
    Zhao, Yvdong
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 3712 - 3717
  • [22] Tree induction vs. logistic regression: A learning-curve analysis
    Perlich, C
    Provost, F
    Simonoff, JS
    JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 4 (02) : 211 - 255
  • [23] Comparative Analysis of Dropout and Student Permanence in Rural Higher Education
    Guzman, Alfredo
    Barragan, Sandra
    Cala-Vitery, Favio
    SUSTAINABILITY, 2022, 14 (14)
  • [24] An investigation of the factors influencing cost system functionality using decision trees, support vector machines and logistic regression
    Kuzey, Cemil
    Uyar, Ali
    Delen, Dursun
    INTERNATIONAL JOURNAL OF ACCOUNTING AND INFORMATION MANAGEMENT, 2019, 27 (01) : 27 - 55
  • [25] Financial prediction: Application of Logistic Regression with factor Analysis
    Han Dongmei
    Ma Liqun
    Yu Changrui
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 9893 - 9896
  • [26] The student support system in mediating work-related dropout: a comparative analysis of four worlds of student funding
    Kalalahti, Mira
    Poder, Kaire
    Lauri, Triin
    Hemila, Pepita
    Skuciene, Daiva
    STUDIES IN HIGHER EDUCATION, 2025,
  • [27] Predicting Student-Teachers Dropout Risk and Early Identification: A Four-Step Logistic Regression Approach
    Singh, Harman Preet
    Alhulail, Hilal Nafil
    IEEE ACCESS, 2022, 10 : 6470 - 6482
  • [28] A comparative analysis of methods for pruning decision trees
    Esposito, F
    Malerba, D
    Semeraro, G
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (05) : 476 - 491
  • [29] Comparative Analysis of Logistic Regression, Gradient Boosted Trees, SVM, and Random Forest Algorithms for Prediction of Acute Kidney Injury Requiring Dialysis After Cardiac Surgery
    Omar, Evi Diana
    Mat, Hasnah
    Abd Karim, Ainil Zafirah
    Sanaudi, Ridwan
    Ibrahim, Fairol H.
    Omar, Mohd Azahadi
    Ismail, Muhd Zulfadli Hafiz
    Jayaraj, Vivek Jason
    Goh, Bak Leong
    INTERNATIONAL JOURNAL OF NEPHROLOGY AND RENOVASCULAR DISEASE, 2024, 17 : 197 - 204
  • [30] Classification of Daily Body Weight Gains in Beef Calves Using Decision Trees, Artificial Neural Networks, and Logistic Regression
    Grzesiak, Wilhelm
    Zaborski, Daniel
    Pilarczyk, Renata
    Wojcik, Jerzy
    Adamczyk, Krzysztof
    ANIMALS, 2023, 13 (12):