Data Augmentation with GAN to Improve the Prediction of At-Risk Students in a Virtual Learning Environment

被引:0
|
作者
Volaric, Tomislav [1 ]
Ljubic, Hrvoje [1 ]
Dominkovic, Marija [1 ]
Martinovic, Goran [2 ]
Rozic, Robert [1 ]
机构
[1] Univ Mostar, Trg Hrvatskih Velikana 1, Mostar, Bosnia & Herceg
[2] Josip Juraj Strossmayer Univ Osijek, Fac Elect Engn Comp Sci & Informat Technol, Osijek, Croatia
来源
ARTIFICIAL INTELLIGENCE IN EDUCATION. POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS, DOCTORAL CONSORTIUM AND BLUE SKY, AIED 2023 | 2023年 / 1831卷
关键词
Data augmentation; Generative adversarial networks; Learning analytics; Virtual learning environment; At-risk students;
D O I
10.1007/978-3-031-36336-8_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we explore the use of data augmentation through generative adversarial networks (GANs) for improving the performance of machine learning models in detecting at-risk students in the context of e-learning institutions. It is well known that balancing datasets can have a positive effect on improving the performance of machine learning models, especially for deep neural networks. However, undersampling can potentially result in the loss of valuable data, so data augmentation seems to be more meaningful solution when the dataset is relatively small. One of the most popular data augmentation approaches is the use of GAN networks due to their ability to generate high-quality synthetic samples that belong to the distribution of the original dataset. On the other hand, detecting at-risk students is a hot topic in learning analytics, and ability to detect these students early with high accuracy enables e-learning institutions to take necessary steps to motivate and retain students during the course. We apply this approach to the OULA dataset, a commonly used dataset in learning analytics that includes labeled at-risk students. The OULA dataset is not highly-imbalanced, making it more challenging to improve model performance through these techniques.
引用
收藏
页码:260 / 265
页数:6
相关论文
共 50 条
  • [1] Analysing University at-Risk Students in a Virtual Learning Environment using Machine Learning Algorithms
    Naidoo, Deshalin
    Adeliyi, Timothy T.
    2023 CONFERENCE ON INFORMATION COMMUNICATIONS TECHNOLOGY AND SOCIETY, ICTAS, 2023, : 113 - 119
  • [2] Predicting at-risk university students in a virtual learning environment via a machine learning algorithm
    Chui, Kwok Tai
    Fung, Dennis Chun Lok
    Lytras, Miltiadis D.
    Lam, Tin Miu
    COMPUTERS IN HUMAN BEHAVIOR, 2020, 107 (107)
  • [3] Fairness in Learning Analytics: Student At-risk Prediction in Virtual Learning Environments
    Riazy, Shirin
    Simbeck, Katharina
    Schreck, Vanessa
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED EDUCATION (CSEDU), VOL 1, 2020, : 15 - 25
  • [4] Virtual Big Data for GAN Based Data Augmentation
    Mansourifar, Hadi
    Chen, Lin
    Shi, Weidong
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1478 - 1487
  • [5] Detection of at-risk students with Learning Analytics Techniques
    Saiz Manzanares, Maria Consuelo
    Marticorena Sanchez, Raul
    Arnaiz Gonzalez, Alvar
    Escolar Llamazares, Maria Del Camino
    Queiruga Dios, Miguel Angel
    EUROPEAN JOURNAL OF INVESTIGATION IN HEALTH PSYCHOLOGY AND EDUCATION, 2018, 8 (03) : 129 - 142
  • [6] EPARS: Early Prediction of At-Risk Students with Online and Offline Learning Behaviors
    Yang, Yu
    Wen, Zhiyuan
    Cao, Jiannong
    Shen, Jiaxing
    Yin, Hongzhi
    Zhou, Xiaofang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT II, 2020, 12113 : 3 - 19
  • [7] Data Mining Approach to the Identification of At-Risk Students
    Ho, Li Chin
    Shim, Kyong Jin
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 5333 - 5335
  • [8] Students' Academic Performance and Engagement Prediction in a Virtual Learning Environment Using Random Forest with Data Balancing
    Jawad, Khurram
    Shah, Muhammad Arif
    Tahir, Muhammad
    SUSTAINABILITY, 2022, 14 (22)
  • [9] Augmentation techniques for sequential clinical data to improve Deep Learning prediction techniques
    Florez, Alexander Y. C.
    Scabora, Lucas
    Amer-Yahia, Sihem
    Rodrigues-Jr, Jose F.
    2020 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS(CBMS 2020), 2020, : 597 - 602
  • [10] Learning Analytics at Low Cost: At-risk Student Prediction with Clicker Data and Systematic Proactive Interventions
    Choi, Samuel P. M.
    Lam, S. S.
    Li, Kam Cheong
    Wong, Billy T. M.
    EDUCATIONAL TECHNOLOGY & SOCIETY, 2018, 21 (02): : 273 - 290