GPU-Accelerated Extreme Learning Machines for Imbalanced Data Streams with Concept Drift

被引:15
作者
Krawczyk, Bartosz [1 ]
机构
[1] Wroclaw Univ Technol, Dept Syst & Comp Networks, Wyb Wyspianskiego 27, PL-50370 Wroclaw, Poland
来源
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016) | 2016年 / 80卷
关键词
Data streams; Imbalanced data; Concept drift; Big data; Extreme learning machines; GPU; ALGORITHM;
D O I
10.1016/j.procs.2016.05.509
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Mining data streams is one of the most vital fields in the current era of big data. Continuously arriving data may pose various problems, connected to their volume, variety or velocity. In this paper we focus on two important difficulties embedded in the nature of data streams: non-stationary nature and skewed class distributions. Such a scenario requires a classifier that is able to rapidly adapt itself to concept drift and displays robustness to class imbalance problem. We propose to use online version of Extreme Learning Machine that is enhanced by an efficient drift detector and method to alleviate the bias towards the majority class. We investigate three approaches based on undersampling, oversampling and cost-sensitive adaptation. Additionally, to allow for a rapid updating of the proposed classifier we show how to implement online Extreme Learning Machines with the usage of GPU. The proposed approach allows for a highly efficient mining of high-speed, drifting and imbalanced data streams with significant acceleration offered by GPU processing.
引用
收藏
页码:1692 / 1701
页数:10
相关论文
共 21 条
  • [1] A Straightforward Implementation of a GPU-accelerated ELM in R with NVIDIA Graphic Cards
    Alia-Martinez, M.
    Antonanzas, J.
    Antonanzas-Torres, F.
    Pernia-Espinoza, A.
    Urraca, R.
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS (HAIS 2015), 2015, 9121 : 656 - 667
  • [2] [Anonymous], 2014, P INT WORKSH NEW FRO
  • [3] Efficient Online Evaluation of Big Data Stream Classifiers
    Bifet, Albert
    Morales, Gianmarco De Francisci
    Read, Jesse
    Holmes, Geoff
    Pfahringer, Bernhard
    [J]. KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 59 - 68
  • [4] Bifet A, 2007, PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, P443
  • [5] Optimized Parameter Search for Large Datasets of the Regularization Parameter and Feature Selection for Ridge Regression
    Buteneers, Pieter
    Caluwaerts, Ken
    Dambre, Joni
    Verstraeten, David
    Schrauwen, Benjamin
    [J]. NEURAL PROCESSING LETTERS, 2013, 38 (03) : 403 - 416
  • [6] Chong E.K., 2013, An introduction to optimization, V76
  • [7] Hybrid computer vision system for drivers' eye recognition and fatigue monitoring
    Cyganek, Boguslaw
    Gruszczynski, Slawomir
    [J]. NEUROCOMPUTING, 2014, 126 : 78 - 94
  • [8] Extreme learning machine: algorithm, theory and applications
    Ding, Shifei
    Zhao, Han
    Zhang, Yanan
    Xu, Xinzheng
    Nie, Ru
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2015, 44 (01) : 103 - 115
  • [9] A survey on learning from data streams: current and future trends
    Gama, Joao
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, 2012, 1 (01) : 45 - 55
  • [10] Huang GB, 2005, PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, P232