Behind the scenes of educational data mining

被引:10
作者
Feldman-Maggor, Yael [1 ,2 ]
Barhoom, Sagiv [2 ,3 ]
Blonder, Ron [1 ]
Tuvi-Arad, Inbal [2 ]
机构
[1] Weizmann Inst Sci, Dept Sci Teaching, Rehovot, Israel
[2] Open Univ Israel, Dept Nat Sci, Raanana, Israel
[3] Open Univ Israel, Dept Informat Syst, Ctr Comp, Raanana, Israel
关键词
Learning analytics; Educational data mining; Data pre-processing; Learning management system (LMS); Moodle; Higher education; LEARNING ANALYTICS;
D O I
10.1007/s10639-020-10309-x
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Research based on educational data mining conducted at academic institutions is often limited by the institutional policy with regard to the type of learning management system and the detail level of its activity reports. Often, researchers deal with only raw data. Such data normally contain numerous fictitious user activities that can create a bias in the activity trends, consequently leading to inaccurate conclusions unless careful strategies for data cleaning, filtering, and indexing are applied. In addition, pre-processing phases are not always reported in detail in the scientific literature. As educational data mining and learning analytics methodologies become increasingly popular in educational research, it is important to promote researchers and educational policymakers' awareness of the pre-processing phase, which is essential to create a reliable database prior to any analysis. This phase can be divided into four consecutive pre-processing stages: data gathering, data interpretation, database creation, and data organization. Taken together, these stages stress the technical and cooperative nature of this type of research, and the need for careful interpretation of the studied parameters. To illustrate these aspects, we applied these stages to online educational data collected from several chemistry courses conducted at two academic institutions. Our results show that adequate pre-processing of the data can prevent major inaccuracies in the research findings, and significantly increase the authenticity and reliability of the conclusions.
引用
收藏
页码:1455 / 1470
页数:16
相关论文
共 50 条
  • [21] On Developing Generic Models for Predicting Student Outcomes in Educational Data Mining
    Ramaswami, Gomathy
    Susnjak, Teo
    Mathrani, Anuradha
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (01)
  • [22] Educational Data Mining and Learning Analytics: differences, similarities, and time evolution
    Calvet Linan, Laura
    Juan Perez, Angel Alejandro
    RUSC-UNIVERSITIES AND KNOWLEDGE SOCIETY JOURNAL, 2015, 12 (03): : 98 - 112
  • [23] Educational Data Mining with Learning Management Systems
    Espigares Pinazo, Manuel Jesus
    Garcia Perez, Rafael
    REVISTA ELECTRONICA DE LEEME, 2011, (27): : 1 - 16
  • [24] Analysing behavioural and academic attributes of students using educational data mining
    Umer, Muhammad
    Sadiq, Saima
    Mehmood, Arif
    Ashraf, Imran
    Choi, Gyu Sang
    Din, Sadia
    INTERNATIONAL JOURNAL OF NANOTECHNOLOGY, 2023, 20 (5-10) : 451 - 476
  • [25] Open Game Data: Defining a Pipeline and Standards for Educational Data Mining and Learning Analytics with Video Game Data
    Gagnon, David J.
    Swanson, Luke
    Harpstead, Erik
    2024 IEEE CONFERENCE ON GAMES, COG 2024, 2024,
  • [26] Educational data mining: A review
    Mohamad, Siti Khadijah
    Tasir, Zaidatun
    9TH INTERNATIONAL CONFERENCE ON COGNITIVE SCIENCE, 2013, 97 : 320 - 324
  • [27] Active Methodology, Educational Data Mining and Learning Analytics: A Systematic Mapping Study
    de Andrade, Tiago Luis
    Rigo, Sandro Jose
    Victoria Barbosa, Jorge Luis
    INFORMATICS IN EDUCATION, 2021, 20 (02): : 171 - 203
  • [28] Educational Data Mining with Python']Python and Apache Spark: A Hands-on Tutorial
    Agnihotri, Lalitha
    Mojarad, Shirin
    Lewkow, Nicholas
    Essa, Alfred
    LAK '16 CONFERENCE PROCEEDINGS: THE SIXTH INTERNATIONAL LEARNING ANALYTICS & KNOWLEDGE CONFERENCE,, 2016, : 507 - 508
  • [29] Educational Data Mining and Learning Analytics in Programming: Literature Review and Case Studies
    Ihantola, Petri
    Vihavainen, Arto
    Ahadi, Alireza
    Butler, Matthew
    Borstler, Jurgen
    Edwards, Stephen H.
    Isohanni, Essi
    Korhonen, Ari
    Petersen, Andrew
    Rivers, Kelly
    Angel Rubio, Miguel
    Sheard, Judy
    Skupas, Bronius
    Spacco, Jaime
    Szabo, Claudia
    Toll, Daniel
    PROCEEDINGS OF THE 2015 ITICSE CONFERENCE ON WORKING GROUP REPORTS (ITICSE-WGP'15), 2016, : 41 - 63
  • [30] Educational data mining in the academic setting: employing the data produced by blended learning to ameliorate the learning process
    Chytas, Konstantinos
    Tsolakidis, Anastasios
    Triperina, Evangelia
    Skourlas, Christos
    DATA TECHNOLOGIES AND APPLICATIONS, 2022, 57 (03) : 1 - 19