MINING STUDENT'S ADMISSION DATA AND PREDICTING STUDENT'S PERFORMANCE USING DECISION TREES

被引:0
|
作者
Asif, R. [1 ]
Merceron, A.
Pathan, M. K. [1 ]
机构
[1] NED Univ Engn & Technol, Karachi, Pakistan
来源
5TH INTERNATIONAL CONFERENCE OF EDUCATION, RESEARCH AND INNOVATION (ICERI 2012) | 2012年
关键词
K-means clustering; decision trees; predicting performance;
D O I
暂无
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
The purpose of data mining is to find out new and possibly useful information from huge amounts of data. Data mining techniques are useful in many application areas like fraud detection, businesses, banking and telecommunications. Educational Data Mining is the application of Data Mining Techniques to educational data. Quality Assurance in education has compelled academia to constantly explore ways to improve overall educational processes. This has led to increasing interest in educational data mining. This paper is a first attempt to retrieve pedagogical information from the data of a public sector engineering university in Pakistan. The data mining techniques are used on the educational data of the undergraduate students in order to predict the performance of students. The study is planned around three research question: Can the students' college marks be used to predict their performance at the undergraduate education? Is the discipline in which they are enrolled significant in predicting their performance? Is any one particular year out of their four years undergraduate studies more decisive than the rest in predicting their performance? To answer these questions, data mining algorithms were applied to identify patterns in the available historical data. The students' marks at college level were examined and mined using the k-means clustering algorithm. The findings revealed a strong correlation in the students' college marks and their marks in individual subjects particularly in Maths, Physics and Chemistry at college level, however, no significant correlation was found between the students' college marks and their overall performance in the undergraduate programme. So the first question is answered negatively which is in agreement with results of different studies conducted in other countries. This analysis suggests that students' performance at university level might be based on the learning and teaching methods of university. The result of clustering pointed out that discipline should be taken into account to predict performance. The application of different decision trees, as classification algorithms, to the examination marks of students from different years of their current degree programme in order to predict their academic achievement in their final year examination indicates that performance in first and second year has a considerably decisive impact in predicting students' final year performance. The study carries important implication for the academic institutions by helping them in providing assistance to students to improve their academic skills at the appropriate level and time.
引用
收藏
页码:5121 / 5129
页数:9
相关论文
共 50 条
  • [41] Assessment of the Risk Factors of Coronary Heart Events Based on Data Mining With Decision Trees
    Karaolis, Minas A.
    Moutiris, Joseph A.
    Hadjipanayi, Demetra
    Pattichis, Constantinos S.
    IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2010, 14 (03): : 559 - 566
  • [42] Multi-GPU approach for big data mining - global induction of decision trees
    Jurczuk, Krzysztof
    Czajkowski, Marcin
    Kretowski, Marek
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 175 - 176
  • [43] Analysis of NIR spectroscopic data using decision trees and their ensembles
    Kucheryavskiy S.
    Journal of Analysis and Testing, 2018, 2 (3) : 274 - 289
  • [44] Utilizing longitudinal data to build decision trees for profile building and predicting eating behavior
    Spanakis, Gerasimos
    Weiss, Gerhard
    Boh, Bastiaan
    Kerkhofs, Vincent
    Roefs, Anne
    INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS/INTERNATIONAL CONFERENCE ON PROJECT MANAGEMENT/INTERNATIONAL CONFERENCE ON HEALTH AND SOCIAL CARE INFORMATION SYSTEMS AND TECHNOLOGIES, CENTERIS/PROJMAN / HCIST 2016, 2016, 100 : 782 - 789
  • [45] PERFORMANCE INDICATOR SELECTION USING DECISION TREES IN ELITE HANDBALL
    Cabrera Quercini, I
    Gonzalez-Ramirez, A.
    Garcia Tormo, J., V
    Martinez, I
    REVISTA INTERNACIONAL DE MEDICINA Y CIENCIAS DE LA ACTIVIDAD FISICA Y DEL DEPORTE, 2022, 22 (88): : 753 - 764
  • [46] Inductive data mining: automatic generation of decision trees from data for QSAR modelling and process historical data analysis
    Ma, Chao Y.
    Buontempo, Frances V.
    Wang, Xue Z.
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2011, 12 (1-2) : 101 - 106
  • [47] Inductive Data Mining: Automatic Generation of Decision Trees from Data for QSAR Modelling and Process Historical Data Analysis
    Ma, Chao Y.
    Buontempo, Frances V.
    Wang, Xue Z.
    18TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING, 2008, 25 : 581 - 586
  • [48] Predicting Mortality in Patients with Stroke Using Data Mining Techniques
    Hadianfard, Zahra
    Afshar, Hadi Lotfnezhad
    Nazarbaghi, Surena
    Rahimi, Bahlol
    Timpka, Toomas
    ACTA INFORMATICA PRAGENSIA, 2022, 11 (01) : 36 - 47
  • [49] Multiple Early-Termination Scheme for TZSearch Algorithm based on Data Mining and Decision Trees
    Goncalves, Paulo
    Correa, Guilherme
    Porto, Marcelo
    Zatt, Bruno
    Agostini, Luciano
    2017 IEEE 19TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2017,
  • [50] Inductive data mining based on genetic programming: Automatic generation of decision trees from data for process historical data analysis
    Ma, Chao Y.
    Wang, Xue Z.
    COMPUTERS & CHEMICAL ENGINEERING, 2009, 33 (10) : 1602 - 1616