Effectiveness of data augmentation to predict students at risk using deep learning algorithms

被引:4
作者
Fahd, Kiran [1 ]
Miah, Shah J. [1 ]
机构
[1] Univ Newcastle, Newcastle Business Sch, Newcastle City Campus, Newcastle, NSW, Australia
关键词
Deep learning; Data augmentation; Multilayer perceptron (MLP); Deep forest (DF); SMOTE; Distribution-based algorithm; HIGHER-EDUCATION; PERFORMANCE; MANAGEMENT; ANALYTICS; DESIGN; SMOTE;
D O I
10.1007/s13278-023-01117-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The academic intervention to predict at-risk higher education (HE) students requires effective data model development. Such data modelling projects in the HE context may have common issues related to (a) adopting small-scale modelling that gives limited options for early intervention and (b) using imbalanced data that hinders capturing effective details of poorly performing students. We address the issues going beyond the distribution-based algorithm, using a multilayer perceptron classifier which shows better on confusion metric, recall, and precision measures for identifying at-risk students. Our proposed deep learning-based model, which uses data augmentation techniques to supplement the data instances and balance the dataset, aims to improve the prediction accuracy of whether the student will fail or not based on their interaction with the learning management systems to prevent struggling students from evasion.
引用
收藏
页数:16
相关论文
共 62 条
[1]   Spam SMS filtering based on text features and supervised machine learning techniques [J].
Abid, Muhammad Adeel ;
Ullah, Saleem ;
Siddique, Muhammad Abubakar ;
Mushtaq, Muhammad Faheem ;
Aljedaani, Wajdi ;
Rustam, Furqan .
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (28) :39853-39871
[2]  
Ajoodha R, 2020, FORECASTING LEARNER
[3]  
Akour M., 2020, Indonesian J. Electr. Eng. Comput. Sci, V19, P387, DOI [10.11591/ijeecs.v19.i1.pp388-394, DOI 10.11591/IJEECS.V19.I1.PP388-394]
[4]   Educational data mining and learning analytics for 21st century higher education: A review and synthesis [J].
Aldowah, Hanan ;
Al-Samarraie, Hosam ;
Fauzy, Wan Mohamad .
TELEMATICS AND INFORMATICS, 2019, 37 :13-49
[5]  
Allah AGF., 2020, J THEOR APPL INF TEC, V8, P3778
[6]  
Barari S., 2019, DEEP LEARNING PYTHON
[7]  
Beer C, 2017, J FURTH HIGH EDUC, V41, P773, DOI 10.1080/0309877X.2016.1177171
[8]  
Berens Johannes, 2019, P ED DAT MIN C, V11, P1, DOI [DOI 10.5281/ZENODO.3594771, 10.5281/ZENODO.3594771]
[9]   Improving the performance of Naive Bayes multinomial in e-mail foldering by introducing distribution-based balance of datasets [J].
Bermejo, Pablo ;
Gamez, Jose A. ;
Puerta, Jose M. .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (03) :2072-2080
[10]  
Canty A., 2020, Learning from Tasmania, V3, P1, DOI DOI 10.37074/JALT.2020.3.S1.3