Probabilistic exact adaptive random forest for recurrent concepts in data streams

被引:5
|
作者
Wu, Ocean [1 ]
Koh, Yun Sing [1 ]
Dobbie, Gillian [1 ]
Lacombe, Thomas [1 ]
机构
[1] Univ Auckland, Sch Comp Sci, Auckland, New Zealand
关键词
Random forest; Recurring concepts; Concept drift; Data stream; CONCEPT DRIFTS;
D O I
10.1007/s41060-021-00273-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to adapt random forests to the dynamic nature of data streams, the state-of-the-art technique discards trained trees and grows new trees when concept drifts are detected. This is particularly wasteful when recurrent patterns exist. In this work, we introduce a novel framework called PEARL, which uses both an exact technique and a probabilistic graphical model with Lossy Counting, to replace drifted trees with relevant trees built in the past. The exact technique utilizes pattern matching to find the set of drifted trees that co-occurred in predictions in the past. Meanwhile, a probabilistic graphical model is being built to capture the tree replacements among recurrent concept drifts. Once the graphical model becomes stable, it replaces the exact technique and finds relevant trees in a probabilistic fashion. Further, Lossy Counting is applied to the graphical model which brings an added theoretical guarantee for both error rate and space complexity. We empirically show our technique outperforms baselines in terms of accuracy and kappa on both synthetic and real-world datasets.
引用
收藏
页码:17 / 32
页数:16
相关论文
共 50 条
  • [41] Adaptive Random Forest for Gait Prediction in Lower Limb Exoskeleton
    Guo, Xudong
    Zhong, Fengqi
    Xiao, Jianru
    Zhou, Zhenhua
    Xu, Wei
    JOURNAL OF BIOMIMETICS BIOMATERIALS AND BIOMEDICAL ENGINEERING, 2024, 64 : 55 - 67
  • [42] Adaptive Bagging Methods for Classification of Data Streams with Concept Drift
    Sarnovsky, Martin
    Marcinko, Jan
    ACTA POLYTECHNICA HUNGARICA, 2021, 18 (03) : 47 - 63
  • [43] Continuous monitoring for changepoints in data streams using adaptive estimation
    Bodenham, Dean A.
    Adams, Niall M.
    STATISTICS AND COMPUTING, 2017, 27 (05) : 1257 - 1270
  • [44] An adaptive algorithm for anomaly and novelty detection in evolving data streams
    Mohamed-Rafik Bouguelia
    Slawomir Nowaczyk
    Amir H. Payberah
    Data Mining and Knowledge Discovery, 2018, 32 : 1597 - 1633
  • [45] An adaptive ensemble classifier for mining concept drifting data streams
    Farid, Dewan Md.
    Zhang, Li
    Hossain, Alamgir
    Rahman, Chowdhury Mofizur
    Strachan, Rebecca
    Sexton, Graham
    Dahal, Keshav
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (15) : 5895 - 5906
  • [46] Adaptive Methods for Classification in Arbitrarily Imbalanced and Drifting Data Streams
    Lichtenwalter, Ryan N.
    Chawla, Nitesh V.
    NEW FRONTIERS IN APPLIED DATA MINING, 2010, 5669 : 53 - 75
  • [47] Concept drift robust adaptive novelty detection for data streams
    Cejnek, Matous
    Bukovsky, Ivo
    NEUROCOMPUTING, 2018, 309 : 46 - 53
  • [48] Probabilistic Forecasting of Generators Startups and Shutdowns in the MISO System Based on Random Forest
    Lin, Xinming
    Hou, Z. Jason
    Chen, Yonghong
    Rose, Steve
    Ma, Yaming
    Pan, Feng
    2020 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2020,
  • [49] Building Classification Using Random Forest to Develop a Geodatabase for Probabilistic Hazard Information
    Kim, Jooho
    Hatzis, Joshua J.
    Klockow, Kim
    Campbell, Patrick A.
    NATURAL HAZARDS REVIEW, 2022, 23 (03)
  • [50] Continuous monitoring for changepoints in data streams using adaptive estimation
    Dean A. Bodenham
    Niall M. Adams
    Statistics and Computing, 2017, 27 : 1257 - 1270