Big data classification of learning behaviour based on data reduction and ensemble learning

被引:1
作者
Wang, Taotao [1 ]
Wu, Xiaoxuan [2 ]
机构
[1] Jiangxi Univ Technol, Dept Informat Engn Coll, Ganzhou 330098, Jiangxi, Peoples R China
[2] Guangxi Vocat Coll Water Resources & Elect Power, Dept Gen Educ, Nanning 530023, Peoples R China
关键词
data reduction; ensemble learning; rough set theory; big data of learning behaviour; big data classification;
D O I
10.1504/IJCEELL.2023.132418
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
In order to overcome the problems of low classification accuracy, long time, and high missing ratio of traditional methods, a big data classification method of learning behaviour based on data reduction and ensemble learning was proposed. By cleaning and transforming the big data of learning behaviour and discretising the attributes of big data of learning behaviour, the data reduction algorithm is used to simplify the attributes of big data of learning behaviour. The ensemble learning method is used to linearly combine several weak classifiers, and the ensemble classifier is trained according to Choquet integral. The trained classifier is used to classify the big data of learning behaviour after simplified processing. The experimental results show that when the amount of big data on learning behaviour reaches 5,000 GB, the average classification accuracy of the proposed method is 92%, the classification time is 29 s, and the failure rate of classification is 0.32%.
引用
收藏
页码:496 / 510
页数:16
相关论文
共 50 条
[21]   Multi-window based ensemble learning for classification of imbalanced streaming data [J].
Li, Hu ;
Wang, Ye ;
Wang, Hua ;
Zhou, Bin .
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2017, 20 (06) :1507-1525
[22]   Multi-window based ensemble learning for classification of imbalanced streaming data [J].
Hu Li ;
Ye Wang ;
Hua Wang ;
Bin Zhou .
World Wide Web, 2017, 20 :1507-1525
[23]   Learning to Grasp Objects based on Ensemble Learning Combining Simulation Data and Real Data [J].
Na, Yong-Ho ;
Jo, HyunJun ;
Song, Jae-Bok .
2017 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2017, :1030-1034
[24]   CTELC: A Constant-Time Ensemble Learning Classifier Based on KNN for Big Data [J].
Tarawneh, Ahmad S. ;
Alamri, Eman S. ;
Al-Saedi, Najah Noori ;
Alauthman, Mohammad ;
Hassanat, Ahmad B. .
IEEE ACCESS, 2023, 11 :89791-89802
[25]   Performance assessment of ensemble learning systems in financial data classification [J].
Lahmiri, Salim ;
Bekiros, Stelios ;
Giakoumelou, Anastasia ;
Bezzina, Frank .
INTELLIGENT SYSTEMS IN ACCOUNTING FINANCE & MANAGEMENT, 2020, 27 (01) :3-9
[26]   imDC: an ensemble learning method for imbalanced classification with miRNA data [J].
Wang, C. Y. ;
Hu, L. L. ;
Guo, M. Z. ;
Liu, X. Y. ;
Zou, Q. .
GENETICS AND MOLECULAR RESEARCH, 2015, 14 (01) :123-133
[27]   Learning ELM-Tree from big data based on uncertainty reduction [J].
Wang, Ran ;
He, Yu-Lin ;
Chow, Chi-Yin ;
Ou, Fang-Fang ;
Zhang, Jian .
FUZZY SETS AND SYSTEMS, 2015, 258 :79-100
[28]   Effect of data preprocessing on ensemble learning for classification in disease diagnosis [J].
Ozkan, Yuksel ;
Demirarslan, Mert ;
Suner, Asli .
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024, 53 (04) :1657-1677
[29]   A novel ensemble machine learning for robust microarray data classification [J].
Peng, Yonghong .
COMPUTERS IN BIOLOGY AND MEDICINE, 2006, 36 (06) :553-573
[30]   An ensemble learning method for classification of multiple-label data [J].
Guangdong Power Dispatching and Controlling Center, Guangzhou, China ;
不详 ;
不详 .
J. Comput. Inf. Syst., 12 (4539-4546) :4539-4546