Concept Drift Adaptive Prediction Method Based on Dynamic Sample Selection

被引:0
|
作者
Dai, Jin [1 ,3 ]
Li, Hao [2 ,3 ]
Wang, Guo-Yin [3 ]
机构
[1] School of Software, Chongqing University of Posts and Telecommunications, Chongqing
[2] School of Computer Science, Chongqing University of Posts and Telecommunications, Chongqing
[3] Chongqing Key Laboratory of Computational Intelligence, Chongqing
来源
Tien Tzu Hsueh Pao/Acta Electronica Sinica | 2024年 / 52卷 / 09期
基金
中国国家自然科学基金;
关键词
adaptive forecast; concept drift; local drift detection; sample denoisy; sample selection; stream data;
D O I
10.12263/DZXB.20230124
中图分类号
学科分类号
摘要
Concept drift is an important performance factor in stream data mining, mainly handled by incremental updating or retraining models, but not fully utilizing existing knowledge. This paper proposed an concept drift adaptive prediction method based on dynamic sample selection, starting from the comprehensive use of all samples. The method performs local consistency based drift detection when new samples arrive, removes noisy samples in the region when drift is detected, and reuses historically similar concepts when new concepts are detected. Finally, multi-representative point summarization is performed for different categories of samples in the region, and the prediction model is updated simultaneously. In this paper, the denoising effect is verified on synthetic datasets containing different drift types, and the prediction task is performed on the real dataset. The experimental results show that the method can effectively remove the drift noise due to conceptual drift, which effectively improves the performance of the prediction model. The prediction outperforms the popular concept drift adaptive model. © 2024 Chinese Institute of Electronics. All rights reserved.
引用
收藏
页码:3228 / 3239
页数:11
相关论文
共 27 条
  • [1] MOHAWESH R, TRAN S, OLLINGTON R, Et al., Analysis of concept drift in fake reviews detection, Expert Systems with Applications, 169, (2021)
  • [2] WANG L, WU C., Dynamic imbalanced business credit evaluation based on Learn++ with sliding time window and weight sampling and FCM with multiple kernels, Information Sciences, 520, pp. 305-323, (2020)
  • [3] HENKE M, SANTOS E, SOUTO E, Et al., Spam detection based on feature evolution to deal withConcept drift, Journal of Universal Computer Science, 27, 4, pp. 364-386, (2021)
  • [4] YANG L M, GUO W B, HAO Q Y, Et al., CADE: Detecting and explaining concept drift samples for security applications, 30th USENIX Security Symposium, pp. 2327-2344, (2021)
  • [5] HAN G J, ZHAO T F, LIU L, Et al., Concept drift detection of industrial data flow based on multivariate region set partition, Acta Electronica Sinica, 51, 7, pp. 1906-1916, (2023)
  • [6] LU K Z, CHEN C F, CAI H, Et al., Online classification algorithm for concept drift and class imbalance data stream, Acta Electronica Sinica, 50, 3, pp. 585-597, (2022)
  • [7] GAMA J, MEDAS P, CASTILLO G, Et al., Learning with drift detection, Advances in Artificial Intelligence-SBIA 2004, pp. 286-295, (2004)
  • [8] FRIAS-BLANCO I, DEL CAMPO-AVILA J, RAMOS-JIMENEZ G, Et al., Online and non-parametric drift detection methods based on hoeffding's bounds, IEEE Transactions on Knowledge and Data Engineering, 27, 3, pp. 810-823, (2015)
  • [9] LIU A J, LU J, ZHANG G Q., Concept drift detection via equal intensity k-means space partitioning, IEEE Transactions on Cybernetics, 51, 6, pp. 3198-3211, (2021)
  • [10] YANG Z, AL-DAHIDI S, BARALDI P, Et al., A novel concept drift detection method for incremental learning in nonstationary environments, IEEE Transactions on Neural Networks and Learning Systems, 31, 1, pp. 309-320, (2020)