An adaptive XGBoost-based optimized sliding window for concept drift handling in non-stationary spatiotemporal data streams classifications

被引:3
|
作者
Angbera, Ature [1 ,2 ]
Chan, Huah Yong [1 ]
机构
[1] Univ Sains Malaysia, Sch Comp Sci, Minden 11800, Pulau Pinang, Malaysia
[2] Joseph Sarwuan Tarka Univ, Dept Comp Sci, Makurdi, Nigeria
来源
JOURNAL OF SUPERCOMPUTING | 2024年 / 80卷 / 06期
关键词
Concept drift; Machine learning; Sliding windows; Spatiotemporal data streams; Bayesian optimization;
D O I
10.1007/s11227-023-05729-8
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the popularity of using data science for decision-making has grown significantly. This rise in popularity has led to a significant learning challenge known as concept drifting, primarily due to the increasing use of spatial and temporal data streaming applications. Concept drift can have highly negative consequences, leading to the degradation of models used in these applications. A new model called BOASWIN-XGBoost (Bayesian Optimized Adaptive Sliding Window and XGBoost) has been introduced in this work to handle concept drift. This model is designed explicitly for classifying streaming data and comprises three main procedures: pre-processing, concept drift detection, and classification. The BOASWIN-XGBoost model utilizes a method called Bayesian-Optimized Adaptive Sliding Window (BOASWIN) to identify the presence of concept drift in the streaming data. Additionally, it employs an optimized XGBoost (eXtreme Gradient Boosting) model for classification purposes. The hyperparameter tuning approach known as BO-TPE (Bayesian Optimization with Tree-structured Parzen Estimator) is employed to fine-tune the XGBoost model's parameters, thus enhancing the classifier's performance. Seven streaming datasets were used to evaluate the proposed approach's performance, including Agrawal_a, Agrawal_g, SEA_a, SEA_g, Hyperplane, Phishing, and Weather. The simulation results demonstrate that the suggested model achieves impressive accuracy values of 70.83%, 71.02%, 76.76%, 76.96%, 84.26%, 95.53%, and 78.35% on the corresponding datasets, affirming its superior performance in handling concept drift and classifying streaming data.
引用
收藏
页码:7781 / 7811
页数:31
相关论文
共 43 条
  • [1] An adaptive XGBoost-based optimized sliding window for concept drift handling in non-stationary spatiotemporal data streams classifications
    Ature Angbera
    Huah Yong Chan
    The Journal of Supercomputing, 2024, 80 : 7781 - 7811
  • [2] Detecting and Tracking Concept Class Drift and Emergence in Non-Stationary Fast Data Streams
    Parker, Brandon S.
    Khan, Latifur
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2908 - 2913
  • [3] Adaptive Ensemble Based Learning in Non-stationary Environments with Variable Concept Drift
    Susnjak, Teo
    Barczak, Andre L. C.
    Hawick, Ken A.
    NEURAL INFORMATION PROCESSING: THEORY AND ALGORITHMS, PT I, 2010, 6443 : 438 - 445
  • [4] Handling Concept Drift in Non-stationary Bandit Through Predicting Future Rewards
    Tsai, Yun-Da
    Lin, Shou-De
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2024 WORKSHOPS, RAFDA AND IWTA, 2024, 14658 : 161 - 173
  • [5] Adaptive Drift Detection Mechanism for Non-Stationary Data Stream
    Nagendhiran, Nalini
    Kuppusamy, Lakshmanan
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2021, 20 (01)
  • [6] A Variable Sliding Window Algorithm Based on Concept Drift for Frequent Pattern Mining Over Data Streams*
    Yin, Yue
    Li, Peng
    Chen, Jing
    2022 IEEE 28TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, ICPADS, 2022, : 818 - 825
  • [7] Recovery Analysis for Adaptive Learning from Non-stationary Data Streams
    Shaker, Ammar
    Huellermeier, Eyke
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2013, 2013, 226 : 289 - 298
  • [8] An online adaptive classifier ensemble for mining non-stationary data streams
    Verdecia-Cabrera, Alberto
    Blanco, Isvani Frias
    Carvalho, Andre C. P. L. F.
    INTELLIGENT DATA ANALYSIS, 2018, 22 (04) : 787 - 806
  • [9] BASWE: Balanced Accuracy-Based Sliding Window Ensemble for Classification in Imbalanced Data Streams with Concept Drift
    de Oliveira, Douglas Amorim
    Delgado, Karina Valdivia
    Lauretto, Marcelo de Souza
    INTELLIGENT SYSTEMS, BRACIS 2024, PT I, 2025, 15412 : 231 - 246
  • [10] Utilizing an Ensemble Machine Learning Framework for Handling Concept Drift in Spatiotemporal Data Streams Classification
    Angbera, Ature
    Chan, Huah Yong
    Informatica (Slovenia), 2024, 48 (02): : 213 - 222