Student Engagement Dataset (SED): An Online Learning Activity Dataset

被引:0
作者
Kassim, M. S. S. [1 ]
Azizul, Z. H. [1 ]
Ahmad Fuaad, A. A. H. [2 ,3 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Artificial Intelligence, Kuala Lumpur 50603, Malaysia
[2] Univ Malaya, Fac Sci, Dept Chem, Kuala Lumpur 50603, Malaysia
[3] Univ Malaya, Acad Strateg Planning Dept, Digital Learning Div, Kuala Lumpur 50603, Malaysia
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Learning analytics; learning management systems (LMSs); online learning; virtual learning environments (VLEs); student engagement; ANALYTICS;
D O I
10.1109/ACCESS.2025.3531102
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Distance learning has become a popular educational medium, and the Internet has spread since the early 2000s. To leverage this phenomenon, learning analytics and data mining can provide insights into improving pedagogy and assessing student engagement. To this end, a student-centric dataset was constructed by extracting data from Universiti Malaya's Moodle-based Virtual Learning Environment (VLE), which serves approximately 25,000 students annually. In this paper, we present the Student Engagement Dataset (SED). The dataset consists of 16,609 students and 2,407 courses. It contains information such as grades and daily logged online activities (approximately 12 million data points), including temporal data across four tables. The tables include student engagement features created by aggregating raw activity data. Here, we present the dataset's properties and describe the data collection, selection, and processing steps. Correlation analysis of student engagement features showed a statistically significant but weak negative correlation between the number of courses, early morning logins, assignments, and top students' performance. SED is expected to present new opportunities for researchers in the learning analytics domain.
引用
收藏
页码:23607 / 23617
页数:11
相关论文
共 22 条
  • [1] Can we predict success from log data in VLEs? Classification of interactions for learning analytics and their relation with performance in VLE-supported F2F and online learning
    Agudo-Peregrina, Angel F.
    Iglesias-Pradas, Santiago
    Angel Conde-Gonzalez, Miguel
    Hernandez-Garcia, Angel
    [J]. COMPUTERS IN HUMAN BEHAVIOR, 2014, 31 : 542 - 550
  • [2] Students Performance Prediction in Online Courses Using Machine Learning Algorithms
    Alshabandar, Raghad
    Hussain, Abir
    Keight, Robert
    Khan, Wasiq
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [3] Predicting Student Grades Based on Their Usage of LMS Moodle Using Petri Nets
    Balogh, Zoltan
    Kucharik, Michal
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (20):
  • [4] Bognár L, 2020, INT CONF COGN INFO, P405, DOI [10.1109/coginfocom50765.2020.9237894, 10.1109/CogInfoCom50765.2020.9237894]
  • [5] Conijn Rianne, 2017, IEEE Transactions on Learning Technologies, V10, P17, DOI 10.1109/TLT.2016.2616312
  • [6] Transitioning to the "new normal" of learning in unpredictable times: pedagogical practices and learning performance in fully online flipped classrooms
    Hew, Khe Foon
    Jia, Chengyuan
    Gonda, Donn Emmanuel
    Bai, Shurui
    [J]. INTERNATIONAL JOURNAL OF EDUCATIONAL TECHNOLOGY IN HIGHER EDUCATION, 2020, 17 (01)
  • [7] Open University Learning Analytics dataset
    Kuzilek, Jakub
    Hlosta, Martin
    Zdrahal, Zdenek
    [J]. SCIENTIFIC DATA, 2017, 4
  • [8] Towards Portability of Models for Predicting Students' Final Performance in University Courses Starting from Moodle Logs
    Lopez-Zambrano, Javier
    Lara, Juan A.
    Romero, Cristobal
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (01):
  • [9] Means B., 2020, SUDDENLY ONLINE NATL
  • [10] Combining University Student Self-Regulated Learning Indicators and Engagement with Online Learning Events to Predict Academic Performance
    Pardo, Abelardo
    Han, Feifei
    Ellis, Robert A.
    [J]. IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2017, 10 (01): : 82 - 92