Comparative Analysis of Prediction Techniques to Determine Student Dropout: Logistic Regression vs Decision Trees

被引:0
作者
Perez, Alfredo [1 ]
Grandon, Elizabeth E. [1 ]
Caniupan, Monica [1 ]
Vargas, Gilda [2 ]
机构
[1] Univ Bio Bio, Dept Sistemas Informac, Concepcion, Chile
[2] Univ Bio Bio, Dept Estadist, Concepcion, Chile
来源
2018 37TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC) | 2018年
关键词
Student Dropout; Data Mining; SAP; Predictive Analytics; Logistic Regression; Decision Trees; HIGHER EDUCATION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Currently, the detection of students who may drop out from an academic program is a relevant issue for universities, so there are efforts to examine the variables that determine students' drop out. Drop out is defined in different ways, however, all the studies converge in that for a student to drop out a course of study, some variables must be combined. This study presents a comparison of performance indicators of the current drop out model of the Universidad del Bio-Bio (UBB), which is based on logistic regression technique and it is compared with a new model based on decision trees. The new model is obtained through data mining methodologies and it was implemented through the SAP Predictive Analytics tool. To train, validate, and apply the model, real data from the UBB databases were used. The comparison shows that the prediction of student' drop out of the proposed model obtains an accuracy of 86%, a precision of 97% with an error rate of 14%, better indicators than the current values delivered by the model based on logistic regression. Subsequently, the prediction model obtained was optimized considering other variables, improving even more the prediction indicators. Higher education institutions should take into account the variables that explain the most the phenomenon of student's drop out to improve the retention of their students.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Efficient and Private Scoring of Decision Trees, Support Vector Machines and Logistic Regression Models Based on Pre-Computation
    De Cock, Martine
    Dowsley, Rafael
    Horst, Caleb
    Katti, Raj
    Nascimento, Anderson C. A.
    Poon, Wing-Sea
    Truex, Stacey
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2019, 16 (02) : 217 - 230
  • [32] Classification of Daily Body Weight Gains in Beef Calves Using Decision Trees, Artificial Neural Networks, and Logistic Regression
    Grzesiak, Wilhelm
    Zaborski, Daniel
    Pilarczyk, Renata
    Wojcik, Jerzy
    Adamczyk, Krzysztof
    ANIMALS, 2023, 13 (12):
  • [33] Cascading logistic regression onto gradient boosted decision trees for forecasting and trading stock indices
    Zhou, Feng
    Zhang, Qun
    Sornette, Didier
    Jiang, Liu
    APPLIED SOFT COMPUTING, 2019, 84
  • [34] A Comparison of Student Academic Achievement Using Decision Trees Techniques: Reflection from University Malaysia Perlis
    Aziz, Fatihah
    Jusoh, Abd Wahab
    Abu, Mohd Syafarudy
    INTERNATIONAL CONFERENCE ON MATHEMATICS, ENGINEERING AND INDUSTRIAL APPLICATIONS 2014 (ICOMEIA 2014), 2015, 1660
  • [35] Comparative Performance Analysis of Random Forest and Logistic Regression Algorithms
    Malkocoglu, Ayse Berika Varol
    Malkocoglu, Sevki Utku
    2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2020, : 25 - 30
  • [36] A comparative study of logistic regression based machine learning techniques for prediction of early virological suppression in antiretroviral initiating HIV patients
    Bisaso, Kuteesa R.
    Karungi, Susan A.
    Kiragga, Agnes
    Mukonzo, Jackson K.
    Castelnuovo, Barbara
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2018, 18
  • [37] A comparative study of logistic regression based machine learning techniques for prediction of early virological suppression in antiretroviral initiating HIV patients
    Kuteesa R. Bisaso
    Susan A. Karungi
    Agnes Kiragga
    Jackson K. Mukonzo
    Barbara Castelnuovo
    BMC Medical Informatics and Decision Making, 18
  • [38] Analysis of Student Dropout in Industrial Engineering Students Using Computational Intelligence Techniques
    Tenjo-Garcia, Jhoan Sebastian
    Figueroa-Garcia, Juan Carlos
    VIII IEEE WORLD ENGINEERING EDUCATION CONFERENCE, EDUNINE 2024, 2024,
  • [39] Prediction of Accrual Expenses in Balance Sheet Using Decision Trees and Linear Regression
    Wang, Chih-Yu
    Lin, Ming-Yen
    2016 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2016, : 73 - 77
  • [40] Decision trees to multiclass prediction for analysis of arecanut data
    Suresha, M.
    Danti, Ajit
    Narasimhamurthy, S. K.
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2014, 29 (01): : 105 - 114