Balancing sequential data to predict students at-risk using adversarial networks

被引:12
|
作者
Waheed, Hajra [1 ]
Anas, Muhammad [1 ]
Hassan, Saeed-Ul [1 ]
Aljohani, Naif Radi [2 ]
Alelyani, Salem [3 ,4 ]
Edifor, Ernest Edem [5 ]
Nawaz, Raheel [5 ]
机构
[1] Informat Technol Univ, 346-B Ferozepur Rd, Lahore, Pakistan
[2] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah, Saudi Arabia
[3] King Khalid Univ, Ctr Artificial Intelligence CAI, POB 9004, Abha 61413, Saudi Arabia
[4] King Khalid Univ, Coll Comp Sci, POB 9004, Abha 61413, Saudi Arabia
[5] Manchester Metropolitan Univ, Operat Technol Events & Hosp Management Business, Manchester M15 6BH, Lancs, England
关键词
Students At-Risk; CGAN; Class Imbalance; Sequential Data; Time-Series; Sythetic Minority Oversampling technique; PERFORMANCE; SMOTE;
D O I
10.1016/j.compeleceng.2021.107274
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Class imbalance is a challenging problem especially in a supervised learning setup, as most classification algorithms are designed for balanced class distributions. Although various up-sampling approaches exist for eliminating the class imbalance, however, they do not handle the complexities of sequential data. In this study, using the data of over 30,000 students from the Open University (UK), we implement a deep-learning-based approach using adversarial networks, Sequential Conditional Generative Adversarial Network (SC-GAN) that encapsulates the past behavior of each student for its previous sequences and generates synthetic student records for the next timestamp. The proposed approach is devised to generate instances, which are augmented with the actual data to eliminate class imbalance. A performance comparison of the proposed SC-GAN with the standard up-sampling methods is also presented and the results validate the proposed method with an improved AUC of 7.07% and 6.53%, respectively, when compared with conventional Random Over-sampling and Sythetic Minority Oversampling techniques.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Imbalanced Fault Diagnosis of Rolling Bearing Using Data Synthesis Based on Multi-Resolution Fusion Generative Adversarial Networks
    Hao, Chuanzhu
    Du, Junrong
    Liang, Haoran
    MACHINES, 2022, 10 (05)
  • [42] Leveraging sequential information from multivariate behavioral sensor data to predict the moment of calving in dairy cattle using deep learning
    Liseune, Arno
    Van den Poel, Dirk
    Hut, Peter R.
    van Eerdenburg, Frank J. C. M.
    Hostens, Miel
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 191
  • [43] Classification of imbalanced data using machine learning algorithms to predict the risk of renal graft failures in Ethiopia
    Mulugeta, Getahun
    Zewotir, Temesgen
    Tegegne, Awoke Seyoum
    Juhar, Leja Hamza
    Muleta, Mahteme Bekele
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
  • [44] Using one-dimensional convolutional neural networks and data augmentation to predict thermal production in geothermal fields
    Yang, Yunxing
    Zhang, Yanjun
    Cheng, Yuxiang
    Lei, Zhihong
    Gao, Xuefeng
    Huang, Yibin
    Ma, Yueqiang
    JOURNAL OF CLEANER PRODUCTION, 2023, 387
  • [45] Quality enhancement at higher education institutions by early identifying students at risk using data mining
    Mahboob, Khalid
    Asif, Raheela
    Haider, Najmi Ghani
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2023, 42 (01) : 120 - 136
  • [46] Novel extreme regression-voting classifier to predict death risk in vaccinated people using VAERS data
    Saad, Eysha
    Sadiq, Saima
    Jamil, Ramish
    Rustam, Furqan
    Mehmood, Arif
    Choi, Gyu Sang
    Ashraf, Imran
    PLOS ONE, 2022, 17 (06):
  • [47] Struggling with strugglers: using data from selection tools for early identification of medical students at risk of failure
    Li, James
    Thompson, Rachel
    Shulruf, Boaz
    BMC MEDICAL EDUCATION, 2019, 19 (01)
  • [48] Data-driven risk analysis of nonlinear factor interactions in road safety using Bayesian networks
    Carrodano, Cinzia
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [49] Using Speech Data From Interactions With a Voice Assistant to Predict the Risk of Future Accidents for Older Drivers: Prospective Cohort Study
    Yamada, Yasunori
    Shinkawa, Kaoru
    Kobayashi, Masatomo
    Takagi, Hironobu
    Nemoto, Miyuki
    Nemoto, Kiyotaka
    Arai, Tetsuaki
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (04)