PCA Feature Extraction for Change Detection in Multidimensional Unlabeled Data

被引:160
作者
Kuncheva, Ludmila I. [1 ]
Faithfull, William J. [1 ]
机构
[1] Bangor Univ, Sch Comp Sci, Bangor LL57 1UT, Gwynedd, Wales
关键词
Change detection; feature extraction; log-likelihood detector; pattern recognition; CONCEPT DRIFT; CHARTS;
D O I
10.1109/TNNLS.2013.2248094
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When classifiers are deployed in real-world applications, it is assumed that the distribution of the incoming data matches the distribution of the data used to train the classifier. This assumption is often incorrect, which necessitates some form of change detection or adaptive classification. While there has been a lot of work on change detection based on the classification error monitored over the course of the operation of the classifier, finding changes in multidimensional unlabeled data is still a challenge. Here, we propose to apply principal component analysis (PCA) for feature extraction prior to the change detection. Supported by a theoretical example, we argue that the components with the lowest variance should be retained as the extracted features because they are more likely to be affected by a change. We chose a recently proposed semiparametric log-likelihood change detection criterion that is sensitive to changes in both mean and variance of the multidimensional distribution. An experiment with 35 datasets and an illustration with a simple video segmentation demonstrate the advantage of using extracted features compared to raw data. Further analysis shows that feature extraction through PCA is beneficial, specifically for data with multiple balanced classes.
引用
收藏
页码:69 / 80
页数:12
相关论文
共 30 条
[11]  
Gama J, 2004, LECT NOTES ARTIF INT, V3171, P286
[12]  
Ho S.-S., 2005, P 22 INT C MACH LEAR, P321, DOI DOI 10.1145/1102351.1102392
[13]   The generalization of Student's ratio [J].
Hotelling, H .
ANNALS OF MATHEMATICAL STATISTICS, 1931, 2 :360-378
[14]  
Kifer D., 2004, VLDB, V4, P180
[15]  
Klinkenberg R., 1998, LEARNING TEXT CATEGO, P33
[16]  
Koychev I, 2005, P 25 SGAI INT C INN, P46
[17]   Change Detection in Streaming Multivariate Data Using Likelihood Detectors [J].
Kuncheva, Ludmila I. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (05) :1175-1180
[18]  
Lazarescu M. M., 2003, Proceedings of the IASTED International Conference on Intelligent Systems and Control, P14
[19]  
Lee DD, 2001, ADV NEUR IN, V13, P556
[20]  
Lung-Yut-Fong A, 2011, INT CONF ACOUST SPEE, P3608