PCA Feature Extraction for Change Detection in Multidimensional Unlabeled Data

被引:160
作者
Kuncheva, Ludmila I. [1 ]
Faithfull, William J. [1 ]
机构
[1] Bangor Univ, Sch Comp Sci, Bangor LL57 1UT, Gwynedd, Wales
关键词
Change detection; feature extraction; log-likelihood detector; pattern recognition; CONCEPT DRIFT; CHARTS;
D O I
10.1109/TNNLS.2013.2248094
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When classifiers are deployed in real-world applications, it is assumed that the distribution of the incoming data matches the distribution of the data used to train the classifier. This assumption is often incorrect, which necessitates some form of change detection or adaptive classification. While there has been a lot of work on change detection based on the classification error monitored over the course of the operation of the classifier, finding changes in multidimensional unlabeled data is still a challenge. Here, we propose to apply principal component analysis (PCA) for feature extraction prior to the change detection. Supported by a theoretical example, we argue that the components with the lowest variance should be retained as the extracted features because they are more likely to be affected by a change. We chose a recently proposed semiparametric log-likelihood change detection criterion that is sensitive to changes in both mean and variance of the multidimensional distribution. An experiment with 35 datasets and an illustration with a simple video segmentation demonstrate the advantage of using extracted features compared to raw data. Further analysis shows that feature extraction through PCA is beneficial, specifically for data with multiple balanced classes.
引用
收藏
页码:69 / 80
页数:12
相关论文
共 30 条
[1]  
Alippi C, 2011, 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), P2889, DOI 10.1109/IJCNN.2011.6033600
[2]  
[Anonymous], 2011, J MACHINE LEARNING R
[3]  
[Anonymous], 2007, Uci machine learning repository
[4]  
Baena-Garcia M, 2006, 4 INT WORKSH KNOWL D, V6, P77
[5]  
Basseville M, 1993, DETECTION ABRUPT CHA
[6]  
Bifet A, 2007, PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, P443
[7]   Feature Extraction for Change-Point Detection Using Stationary Subspace Analysis [J].
Blythe, Duncan A. J. ;
von Buenau, Paul ;
Meinecke, Frank C. ;
Mueller, Klaus-Robert .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (04) :631-643
[8]  
Dasu T., 2006, P S INT STAT COMP S, P1
[9]  
Everitt B.S., 2001, A Handbook of Statistical Analysis Using S-PLUS, V2nd
[10]  
Gabele M., 2006, P 3 INT WORKSH KNOWL, P1