Big data classification of learning behaviour based on data reduction and ensemble learning

被引:1
|
作者
Wang, Taotao [1 ]
Wu, Xiaoxuan [2 ]
机构
[1] Jiangxi Univ Technol, Dept Informat Engn Coll, Ganzhou 330098, Jiangxi, Peoples R China
[2] Guangxi Vocat Coll Water Resources & Elect Power, Dept Gen Educ, Nanning 530023, Peoples R China
关键词
data reduction; ensemble learning; rough set theory; big data of learning behaviour; big data classification;
D O I
10.1504/IJCEELL.2023.132418
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
In order to overcome the problems of low classification accuracy, long time, and high missing ratio of traditional methods, a big data classification method of learning behaviour based on data reduction and ensemble learning was proposed. By cleaning and transforming the big data of learning behaviour and discretising the attributes of big data of learning behaviour, the data reduction algorithm is used to simplify the attributes of big data of learning behaviour. The ensemble learning method is used to linearly combine several weak classifiers, and the ensemble classifier is trained according to Choquet integral. The trained classifier is used to classify the big data of learning behaviour after simplified processing. The experimental results show that when the amount of big data on learning behaviour reaches 5,000 GB, the average classification accuracy of the proposed method is 92%, the classification time is 29 s, and the failure rate of classification is 0.32%.
引用
收藏
页码:496 / 510
页数:16
相关论文
共 50 条
  • [1] Intrusion detection based on ensemble learning for big data classification
    Jemili, Farah
    Meddeb, Rahma
    Korbaa, Ouajdi
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 3771 - 3798
  • [2] Imbalanced Data Classification Method Based on Ensemble Learning
    Xiang, Yu
    Xie, Yongping
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 18 - 24
  • [3] Towards Big Data Bayesian Network Learning - an Ensemble Learning Based Approach
    Tang, Yan
    Wang, Yu
    Li, Ling
    Cooper, Kendra M. L.
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 355 - 357
  • [4] A dynamic ensemble learning based data mining framework for medical imbalanced big data
    Rithani, M.
    Kumar, R. Prasanna
    Ali, Altalbe
    KNOWLEDGE-BASED SYSTEMS, 2025, 310
  • [5] Spark-based ensemble learning for imbalanced data classification
    Ding J.
    Wang S.
    Jia L.
    You J.
    Jiang Y.
    International Journal of Performability Engineering, 2018, 14 (05) : 945 - 964
  • [6] Unbalanced data sentiment classification method based on ensemble learning
    Duan, Jidong
    Ma, Kun
    Sun, Runyuan
    PROCEEDINGS OF 2019 2ND INTERNATIONAL CONFERENCE ON BIG DATA TECHNOLOGIES (ICBDT 2019), 2019, : 34 - 38
  • [7] A Survey on Ensemble Learning for Data Stream Classification
    Gomes, Heitor Murilo
    Barddal, Jean Paul
    Enembreck, Fabricio
    Bifet, Albert
    ACM COMPUTING SURVEYS, 2017, 50 (02)
  • [8] An Improved Ensemble Learning for Imbalanced Data Classification
    Yuan, Zhengwu
    Zhao, Pu
    PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 408 - 411
  • [9] An Algorithm Design of Big Data Anomaly Detection Based on Ensemble Learning
    Chen, Xiao
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON COMPUTER AND MULTIMEDIA TECHNOLOGY, ICCMT 2024, 2024, : 319 - 323
  • [10] Empirical Analysis of Asymptotic Ensemble Learning for Big Data
    Salloum, Salman
    Huang, Joshua Zhexue
    He, Yulin
    2016 3RD IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES (BDCAT), 2016, : 8 - 17