Comparative Analysis of Prediction Techniques to Determine Student Dropout: Logistic Regression vs Decision Trees

被引:0
|
作者
Perez, Alfredo [1 ]
Grandon, Elizabeth E. [1 ]
Caniupan, Monica [1 ]
Vargas, Gilda [2 ]
机构
[1] Univ Bio Bio, Dept Sistemas Informac, Concepcion, Chile
[2] Univ Bio Bio, Dept Estadist, Concepcion, Chile
来源
2018 37TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC) | 2018年
关键词
Student Dropout; Data Mining; SAP; Predictive Analytics; Logistic Regression; Decision Trees; HIGHER EDUCATION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Currently, the detection of students who may drop out from an academic program is a relevant issue for universities, so there are efforts to examine the variables that determine students' drop out. Drop out is defined in different ways, however, all the studies converge in that for a student to drop out a course of study, some variables must be combined. This study presents a comparison of performance indicators of the current drop out model of the Universidad del Bio-Bio (UBB), which is based on logistic regression technique and it is compared with a new model based on decision trees. The new model is obtained through data mining methodologies and it was implemented through the SAP Predictive Analytics tool. To train, validate, and apply the model, real data from the UBB databases were used. The comparison shows that the prediction of student' drop out of the proposed model obtains an accuracy of 86%, a precision of 97% with an error rate of 14%, better indicators than the current values delivered by the model based on logistic regression. Subsequently, the prediction model obtained was optimized considering other variables, improving even more the prediction indicators. Higher education institutions should take into account the variables that explain the most the phenomenon of student's drop out to improve the retention of their students.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] COMPARATIVE STUDY OF BIODEGRADABILITY PREDICTION OF CHEMICALS USING DECISION TREES, FUNCTIONAL TREES, AND LOGISTIC REGRESSION
    Chen, Guangchao
    Li, Xuehua
    Chen, Jingwen
    Zhang, Ya-nan
    Peijnenburg, Willie J. G. M.
    ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY, 2014, 33 (12) : 2688 - 2693
  • [2] Student Dropout Model Based on Logistic Regression
    Cuji Chacha, Blanca Rocio
    Gavilanes Lopez, Wilma Lorena
    Vicente Guerrero, Victor Xavier
    Villacis Villacis, Wilma Guadalupe
    APPLIED TECHNOLOGIES (ICAT 2019), PT II, 2020, 1194 : 321 - 333
  • [3] PREDICTION OF CANNABIS AND COCAINE USE IN ADOLESCENCE USING DECISION TREES AND LOGISTIC REGRESSION
    Gervilla, Elena
    Palmer, Alfonso
    EUROPEAN JOURNAL OF PSYCHOLOGY APPLIED TO LEGAL CONTEXT, 2010, 2 (01) : 19 - 35
  • [4] Aplication of Decision Trees for Detection of Student Dropout Profiles
    Timaran Pereira, Ricardo
    Caicedo Zambrano, Javier
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 528 - 531
  • [5] Analysis of Customer Churn Prediction in Telecom Industry using Decision Trees and Logistic Regression
    Dalvi, Preeti K.
    Khandge, Siddhi K.
    Deomore, Ashish
    Bankar, Aditya
    Kanade, V. A.
    2016 SYMPOSIUM ON COLOSSAL DATA ANALYSIS AND NETWORKING (CDAN), 2016,
  • [6] Comparative Analysis of Decision Trees with Logistic Regression in Predicting Fault-Prone Classes
    Singh, Yogesh
    Takkar, Arvinder Kaur
    Malhotra, Ruchika
    INFORMATION SYSTEMS, TECHNOLOGY AND MANAGEMENT-THIRD INTERNATIONAL CONFERENCE, ICISTM 2009, 2009, 31 : 337 - 338
  • [7] Prediction Accuracy Analysis with Logistic Regression and CART Decision Tree
    Zhang, Xudong
    Wang, Di
    Qian, Ying
    Yang, Yingming
    FOURTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2019, 11198
  • [8] Modelling speech emotion recognition using logistic regression and decision trees
    Jacob A.
    International Journal of Speech Technology, 2017, 20 (4) : 897 - 905
  • [9] Landslide Susceptibility Assessment Using Bagging Ensemble Based Alternating Decision Trees, Logistic Regression and J48 Decision Trees Methods: A Comparative Study
    Pham B.T.
    Tien Bui D.
    Prakash I.
    Geotechnical and Geological Engineering, 2017, 35 (6) : 2597 - 2611
  • [10] A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees
    De Caigny, Arno
    Coussement, Kristof
    De Bock, Koen W.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 269 (02) : 760 - 772