Big data classification of learning behaviour based on data reduction and ensemble learning

被引:1
作者
Wang, Taotao [1 ]
Wu, Xiaoxuan [2 ]
机构
[1] Jiangxi Univ Technol, Dept Informat Engn Coll, Ganzhou 330098, Jiangxi, Peoples R China
[2] Guangxi Vocat Coll Water Resources & Elect Power, Dept Gen Educ, Nanning 530023, Peoples R China
关键词
data reduction; ensemble learning; rough set theory; big data of learning behaviour; big data classification;
D O I
10.1504/IJCEELL.2023.132418
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
In order to overcome the problems of low classification accuracy, long time, and high missing ratio of traditional methods, a big data classification method of learning behaviour based on data reduction and ensemble learning was proposed. By cleaning and transforming the big data of learning behaviour and discretising the attributes of big data of learning behaviour, the data reduction algorithm is used to simplify the attributes of big data of learning behaviour. The ensemble learning method is used to linearly combine several weak classifiers, and the ensemble classifier is trained according to Choquet integral. The trained classifier is used to classify the big data of learning behaviour after simplified processing. The experimental results show that when the amount of big data on learning behaviour reaches 5,000 GB, the average classification accuracy of the proposed method is 92%, the classification time is 29 s, and the failure rate of classification is 0.32%.
引用
收藏
页码:496 / 510
页数:16
相关论文
共 50 条
[41]   A Classifier Ensemble Framework for Multimedia Big Data Classification [J].
Yan, Yilin ;
Zhu, Qiusha ;
Shyu, Mei-Ling ;
Chen, Shu-Ching .
PROCEEDINGS OF 2016 IEEE 17TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IEEE IRI), 2016, :615-622
[42]   Genetic Programming with Interval Functions and Ensemble Learning for Classification with Incomplete Data [J].
Cao Truong Tran ;
Zhang, Mengjie ;
Xue, Bing ;
Andreae, Peter .
AI 2018: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, 11320 :577-589
[43]   ElStream: An Ensemble Learning Approach for Concept Drift Detection in Dynamic Social Big Data Stream Learning [J].
Abbasi, Ahmad ;
Javed, Abdul Rehman ;
Chakraborty, Chinmay ;
Nebhen, Jamel ;
Zehra, Wisha ;
Jalil, Zunera .
IEEE ACCESS, 2021, 9 :66408-66419
[44]   A Classifier Using Online Bagging Ensemble Method for Big Data Stream Learning [J].
Yanxia Lv ;
Sancheng Peng ;
Ying Yuan ;
Cong Wang ;
Pengfei Yin ;
Jiemin Liu ;
Cuirong Wang .
Tsinghua Science and Technology, 2019, (04) :379-388
[45]   RSPCA: Random Sample Partition and Clustering Approximation for ensemble learning of big data [J].
Mahmud, Mohammad Sultan ;
Zheng, Hua ;
Garcia-Gil, Diego ;
Garcia, Salvador ;
Huang, Joshua Zhexue .
PATTERN RECOGNITION, 2025, 161
[46]   A Classifier Using Online Bagging Ensemble Method for Big Data Stream Learning [J].
Lv, Yanxia ;
Peng, Sancheng ;
Yuan, Ying ;
Wang, Cong ;
Yin, Pengfei ;
Liu, Jiemin ;
Wang, Cuirong .
TSINGHUA SCIENCE AND TECHNOLOGY, 2019, 24 (04) :379-388
[47]   EMRIL: Ensemble Method based on ReInforcement Learning for binary classification in imbalanced drifting data streams [J].
Usman, Muhammad ;
Chen, Huanhuan .
NEUROCOMPUTING, 2024, 605
[48]   Improving Colorectal Polyp Classification Based on Physical Examination Data-An Ensemble Learning Approach [J].
Xie, Xiaolei ;
Xing, Jie ;
Kong, Nan ;
Li, Chong ;
Li, Jinlin ;
Zhang, Shutian .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (01) :434-441
[49]   Ensemble classifier based big data classification with hybrid optimal feature selection [J].
Pamila, J. C. Miraclin Joyce ;
Selvi, R. Senthamil ;
Santhi, P. ;
Nithya, T. M. .
ADVANCES IN ENGINEERING SOFTWARE, 2022, 173
[50]   DISTRIBUTED LEARNING ALGORITHM BASED ON DATA REDUCTION [J].
Czarnowski, Ireneusz ;
Jedrzejowicz, Piotr .
ICAART 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, 2009, :198-+