Educational data mining: A tutorial for the rattle package in R

被引:6
作者
Bulut, Okan [1 ]
Yavuz, Hatice Cigdem [2 ]
机构
[1] Univ Alberta, Ctr Res Appl Measurement & Evaluat, Edmonton, AB, Canada
[2] Cukurova Univ, Fac Educ, Saricam Adana, Turkey
关键词
Educational data mining; rattle; Decision tree; Random forest; Support vector machines; CLASSIFICATION; ANALYTICS;
D O I
10.21449/ijate.627361
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Educational data mining (EDM) has been a rapidly growing research field over the last decade and enabled researchers to discover patterns and trends in education with more sophisticated methods. EDM offers promising solutions to complex educational problems. Given the rapid increase in the availability of big data in education and software programs to analyze big data, the demand for user-friendly, free software programs to implement EDM methods also continues to increase. The R programming language has become a popular environment for data mining due to its availability and flexibility. The rattle package in R contains a set of functions to implement data mining with a graphical user interface. This study demonstrates three widely used data mining algorithms (classification and regression tree, random forest, and support vector machine) in EDM using real data from the 2015 administration of the Programme for International Student Assessment (PISA). First, a brief introduction to EDM is provided along with the description of the selected data mining algorithms. Then, how to perform data mining analysis using the rattle's graphical user interface is demonstrated. The study concludes by comparing the results of the selected data mining algorithms and highlighting how those algorithms can be utilized in the context of educational research.
引用
收藏
页码:20 / 36
页数:17
相关论文
共 30 条
[1]  
Agarwal S., 2012, International Journal of e-Education, e-Business, e-Management and e-Learning, DOI DOI 10.7763/IJEEEE.2012.V2.97
[2]   Educational data mining and learning analytics for 21st century higher education: A review and synthesis [J].
Aldowah, Hanan ;
Al-Samarraie, Hosam ;
Fauzy, Wan Mohamad .
TELEMATICS AND INFORMATICS, 2019, 37 :13-49
[3]  
Aulck L., 2016, ARXIV160606364
[4]  
Baker R. S., 2017, WILEY HDB COGNITION, P379
[5]   Educational Data Mining and Learning Analytics: Applications to Constructionist Research [J].
Berland, Matthew ;
Baker, Ryan S. ;
Blikstein, Paulo .
TECHNOLOGY KNOWLEDGE AND LEARNING, 2014, 19 (1-2) :205-220
[6]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[9]   Educational Big Data Mining: How to Enhance Virtual Learning Environments [J].
Ducange, Pietro ;
Pecori, Riccardo ;
Sarti, Luigi ;
Vecchio, Massimo .
INTERNATIONAL JOINT CONFERENCE SOCO'16- CISIS'16-ICEUTE'16, 2017, 527 :681-690
[10]   A Systematic Review on Educational Data Mining [J].
Dutti, Ashish ;
Ismaili, Maizatul Akmar ;
Herawani, Tutut .
IEEE ACCESS, 2017, 5 :15991-16005