An adaptive XGBoost-based optimized sliding window for concept drift handling in non-stationary spatiotemporal data streams classifications

被引:3
|
作者
Angbera, Ature [1 ,2 ]
Chan, Huah Yong [1 ]
机构
[1] Univ Sains Malaysia, Sch Comp Sci, Minden 11800, Pulau Pinang, Malaysia
[2] Joseph Sarwuan Tarka Univ, Dept Comp Sci, Makurdi, Nigeria
来源
JOURNAL OF SUPERCOMPUTING | 2024年 / 80卷 / 06期
关键词
Concept drift; Machine learning; Sliding windows; Spatiotemporal data streams; Bayesian optimization;
D O I
10.1007/s11227-023-05729-8
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the popularity of using data science for decision-making has grown significantly. This rise in popularity has led to a significant learning challenge known as concept drifting, primarily due to the increasing use of spatial and temporal data streaming applications. Concept drift can have highly negative consequences, leading to the degradation of models used in these applications. A new model called BOASWIN-XGBoost (Bayesian Optimized Adaptive Sliding Window and XGBoost) has been introduced in this work to handle concept drift. This model is designed explicitly for classifying streaming data and comprises three main procedures: pre-processing, concept drift detection, and classification. The BOASWIN-XGBoost model utilizes a method called Bayesian-Optimized Adaptive Sliding Window (BOASWIN) to identify the presence of concept drift in the streaming data. Additionally, it employs an optimized XGBoost (eXtreme Gradient Boosting) model for classification purposes. The hyperparameter tuning approach known as BO-TPE (Bayesian Optimization with Tree-structured Parzen Estimator) is employed to fine-tune the XGBoost model's parameters, thus enhancing the classifier's performance. Seven streaming datasets were used to evaluate the proposed approach's performance, including Agrawal_a, Agrawal_g, SEA_a, SEA_g, Hyperplane, Phishing, and Weather. The simulation results demonstrate that the suggested model achieves impressive accuracy values of 70.83%, 71.02%, 76.76%, 76.96%, 84.26%, 95.53%, and 78.35% on the corresponding datasets, affirming its superior performance in handling concept drift and classifying streaming data.
引用
收藏
页码:7781 / 7811
页数:31
相关论文
共 43 条
  • [21] Adaptive windowing based recurrent neural network for drift adaption in non-stationary environment
    Suryawanshi S.
    Goswami A.
    Patil P.
    Mishra V.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (10) : 14125 - 14139
  • [22] Recognition of non-stationary signal instantaneous frequency with LMSSGST based on sliding window width optimization
    Liu, Jingliang
    Su, Jielong
    Dai, Yichen
    Li, Yuzu
    Huang, Yong
    Zheng, Wenting
    Zhendong yu Chongji/Journal of Vibration and Shock, 2024, 43 (19): : 183 - 193
  • [23] Adaptive correlation analysis of non-stationary random processes based on a movable window of analysis
    Pogribnoi, VA
    Rozhankovskii, IV
    Gren, YV
    IZVESTIYA VYSSHIKH UCHEBNYKH ZAVEDENII RADIOELEKTRONIKA, 2002, 45 (5-6): : 29 - 34
  • [24] Action recognition using optimized deep autoencoder and CNN for surveillance data streams of non-stationary environments
    Ullah, Amin
    Muhammad, Khan
    Ul Haq, Ijaz
    Baik, Sung Wook
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 96 : 386 - 397
  • [25] Incremental Learning in Non-stationary Environments with Concept Drift using a Multiple Classifier Based Approach
    Karnick, Matthew
    Muhlbaier, Michael D.
    Polikar, Robi
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 497 - 500
  • [26] Sparse representation-based correlation analysis of non-stationary spatiotemporal big data
    Song, Weijing
    Liu, Peng
    Wang, Lizhe
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2016, 9 (09) : 892 - 913
  • [27] Recovery analysis for adaptive learning from non-stationary data streams: Experimental design and case study
    Shaker, Ammar
    Huellermeier, Eyke
    NEUROCOMPUTING, 2015, 150 : 250 - 264
  • [28] An Efficient Sliding Window Based Algorithm for Adaptive Frequent Itemset Mining over Data Streams
    Deypir, Mhmood
    Sadreddini, Mohammad Hadi
    Taahomi, Mehran
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2013, 29 (05) : 1001 - 1020
  • [29] Anomaly detection method for sensor network data streams based on sliding window sampling and optimized clustering
    Lin, Ling
    Su, Jinshan
    SAFETY SCIENCE, 2019, 118 : 70 - 75
  • [30] Adaptive Chunk-Based Dynamic Weighted Majority for Imbalanced Data Streams With Concept Drift
    Lu, Yang
    Cheung, Yiu-Ming
    Yan Tang, Yuan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (08) : 2764 - 2778