Online neural network model for non-stationary and imbalanced data stream classification

被引:62
作者
Ghazikhani, Adel [1 ]
Monsefi, Reza [1 ]
Yazdi, Hadi Sadoghi [1 ]
机构
[1] Ferdowsi Univ Mashhad, Dept Comp Engn, Mashhad, Iran
关键词
Data stream classification; Online learning; Neural Networks; Concept drift; Imbalanced data; FEATURE-SELECTION; ENVIRONMENTS; PREDICTION; ALGORITHM;
D O I
10.1007/s13042-013-0180-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
"Concept drift'' and class imbalance are two challenges for supervised classifiers. "Concept drift'' (or non-stationarity) is changes in the underlying function being learnt, and class imbalance is a vast difference between the numbers of instances in different classes of data. Class imbalance is an obstacle for the efficiency of most classifiers. Previous methods for classifying non-stationary and imbalanced data streams mainly focus on batch solutions, in which the classification model is trained using a chunk of data. Here, we propose an online Neural Network (NN) model. The NN model, is composed of two different parts for handling concept drift and class imbalance. Concept drift is handled with a forgetting function and class imbalance is handled with a specific error function which assigns different importance to error in separate classes. The proposed method is evaluated on 3 synthetic and 8 real world datasets. The results show statistically significant improvement to previous online NN methods.
引用
收藏
页码:51 / 62
页数:12
相关论文
共 43 条
[1]   Classification Using Streaming Random Forests [J].
Abdulsalam, Hanady ;
Skillicorn, David B. ;
Martin, Patrick .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (01) :22-36
[2]  
Alpaydin E., 2010, Introduction to Machine Learning, V2
[3]  
[Anonymous], 1986, FOUNDATIONS, DOI DOI 10.7551/MITPRESS/5236.001.0001
[4]   Parameter selection algorithm with self adaptive growing neural network classifier for diagnosis issues [J].
Barakat, M. ;
Lefebvre, D. ;
Khalil, M. ;
Druaux, F. ;
Mustapha, O. .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2013, 4 (03) :217-233
[5]   FSVM-CIL: Fuzzy Support Vector Machines for Class Imbalance Learning [J].
Batuwita, Rukshan ;
Palade, Vasile .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2010, 18 (03) :558-571
[6]   Classifying cognitive states of brain activity via one-class neural networks with feature selection by genetic algorithms [J].
Boehm, Omer ;
Hardoon, David R. ;
Manevitz, Larry M. .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2011, 2 (03) :125-134
[7]   Incremental learning with multi-level adaptation [J].
Bouchachia, Abdelhamid .
NEUROCOMPUTING, 2011, 74 (11) :1785-1799
[8]   Towards incremental learning of nonstationary imbalanced data stream: a multiple selectively recursive approach [J].
Chen, Sheng ;
He, Haibo .
EVOLVING SYSTEMS, 2011, 2 (01) :35-50
[9]  
Ditzler G, 2010, WCCI
[10]   Incremental Learning of Concept Drift in Nonstationary Environments [J].
Elwell, Ryan ;
Polikar, Robi .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (10) :1517-1531