Educational Data Mining Clustering Approach: Case Study of Undergraduate Student Thesis Topic

被引:2
作者
Andre
Suciati, Nanik [1 ]
Fabroyir, Hadziq [1 ]
Pardede, Eric [2 ]
机构
[1] Inst Teknol Sepuluh Nopember, Fac Intelligent Elect & Informat Technol, Dept Informat, Surabaya 60111, Indonesia
[2] La Trobe Univ, Dept Comp Sci & Informat Technol, Bundoora, Vic 3086, Australia
关键词
Computing classification system; undergraduate thesis; clustering analysis; k-means; ontology; PROCRASTINATION; RECOMMENDATION; PREDICTION; PATTERNS; COURSES;
D O I
10.1109/ACCESS.2023.3332818
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study aims to investigate the potential of educational data mining (EDM) in addressing the issue of delayed completion in undergraduate student thesis courses. Delayed completion of these courses is a common issue that affects both students and higher education institutions. This study employed clustering analysis to create clusters of thesis topics. The research model was constructed using expert labeling to assign each thesis title to a computer science ontology standard. Cross-referencing was employed to associate supporting courses with each thesis title, resulting in a labeled dataset with three supporting courses for each thesis title. This study analyzed five different clustering algorithms, including K-Means, DBScan, BIRCH, Gaussian Mixture, and Mean Shift, to identify the best approach for analyzing undergraduate thesis data. The results demonstrated that k-means clustering is the most efficient method, generating five distinct clusters with unique characteristics. Furthermore, this study investigated the correlation between educational data, specifically GPA, and the average grades of courses that support a thesis title and the duration of thesis completion. Our investigation revealed a moderate correlation between GPA, thesis-supporting course average grades, and the time to complete the thesis, with higher academic performance being associated with shorter completion times. These moderate results indicate the need for further studies to explore additional factors beyond GPA and the average grades of thesis-supporting courses that contribute to delays in thesis completion. This study contributes to the understanding and evaluation of educational outcomes within study programs, as defined in the curriculum, particularly concerning the design and implementation of thesis topics. Additionally, the clustering results serve as a foundation for future research and offer valuable insights into the potential of EDM techniques to assist in selecting appropriate thesis topics, thereby reducing the risk of delayed completion.
引用
收藏
页码:130072 / 130088
页数:17
相关论文
共 40 条
[1]   Combination of machine learning algorithms for recommendation of courses in E-Learning System based on historical data [J].
Aher, Sunita B. ;
Lobo, L. M. R. J. .
KNOWLEDGE-BASED SYSTEMS, 2013, 51 :1-14
[2]  
Andre Andre, 2022, 2022 11th Electrical Power, Electronics, Communications, Controls and Informatics Seminar (EECCIS), P345, DOI 10.1109/EECCIS54468.2022.9902931
[3]   Student Dropout Prediction [J].
Del Bonifro, Francesca ;
Gabbrielli, Maurizio ;
Lisanti, Giuseppe ;
Zingaro, Stefano Pio .
ARTIFICIAL INTELLIGENCE IN EDUCATION (AIED 2020), PT I, 2020, 12163 :129-140
[4]  
Durairaj M., 2014, International Journal of Computer Science and Information Technologies, V5, P5987
[5]  
2020, International Journal of Computing Communications and Networking, V9, P39, DOI [10.30534/ijccn/2020/01932019, 10.30534/ijccn/2020/01932019, DOI 10.30534/IJCCN/2020/01932019]
[6]  
Efrati Valentina, 2014, Universal Access in Human-Computer Interaction. Universal Access to Information and Knowledge. 8th International Conference, UAHCI 2014, Held as Part of HCI International 2014. Proceedings: LNCS 8514, P289, DOI 10.1007/978-3-319-07440-5_27
[7]   Determining the Parameters of DBSCAN Automatically Using the Multi-Objective Genetic Algorithm [J].
Falahiazar, Zeinab ;
Bagheri, Alireza ;
Reshadi, Midia .
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2021, 37 (01) :157-183
[8]  
International Educational Data Mining Society, 2022, about us
[9]  
Irrazábal E, 2017, 2017 XLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI)
[10]  
Jeong H., 2008, International Conference on Educational Data Mining, P127