Balancing sequential data to predict students at-risk using adversarial networks

被引:12
|
作者
Waheed, Hajra [1 ]
Anas, Muhammad [1 ]
Hassan, Saeed-Ul [1 ]
Aljohani, Naif Radi [2 ]
Alelyani, Salem [3 ,4 ]
Edifor, Ernest Edem [5 ]
Nawaz, Raheel [5 ]
机构
[1] Informat Technol Univ, 346-B Ferozepur Rd, Lahore, Pakistan
[2] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah, Saudi Arabia
[3] King Khalid Univ, Ctr Artificial Intelligence CAI, POB 9004, Abha 61413, Saudi Arabia
[4] King Khalid Univ, Coll Comp Sci, POB 9004, Abha 61413, Saudi Arabia
[5] Manchester Metropolitan Univ, Operat Technol Events & Hosp Management Business, Manchester M15 6BH, Lancs, England
关键词
Students At-Risk; CGAN; Class Imbalance; Sequential Data; Time-Series; Sythetic Minority Oversampling technique; PERFORMANCE; SMOTE;
D O I
10.1016/j.compeleceng.2021.107274
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Class imbalance is a challenging problem especially in a supervised learning setup, as most classification algorithms are designed for balanced class distributions. Although various up-sampling approaches exist for eliminating the class imbalance, however, they do not handle the complexities of sequential data. In this study, using the data of over 30,000 students from the Open University (UK), we implement a deep-learning-based approach using adversarial networks, Sequential Conditional Generative Adversarial Network (SC-GAN) that encapsulates the past behavior of each student for its previous sequences and generates synthetic student records for the next timestamp. The proposed approach is devised to generate instances, which are augmented with the actual data to eliminate class imbalance. A performance comparison of the proposed SC-GAN with the standard up-sampling methods is also presented and the results validate the proposed method with an improved AUC of 7.07% and 6.53%, respectively, when compared with conventional Random Over-sampling and Sythetic Minority Oversampling techniques.
引用
收藏
页数:10
相关论文
共 50 条