Outliers and change-points detection algorithm for time series

被引:0
作者
Su, Weixing [1 ,2 ]
Zhu, Yunlong [1 ]
Liu, Fang [3 ]
Hu, Kunyuan [1 ]
机构
[1] Shenyang Institute of Automation, Chinese Academy of Sciences
[2] University of Chinese Academy of Sciences
[3] Brilliance Automobile Engineering Research Institute
来源
Jisuanji Yanjiu yu Fazhan/Computer Research and Development | 2014年 / 51卷 / 04期
关键词
Change-point; Lipschitz exponent; Outlier; Time series; Wavelet transform;
D O I
10.7544/issn1000-1239.2014.20120542
中图分类号
学科分类号
摘要
Because the conventional change-points detection method exists the shortages on time delay and inapplicability for the time series mingled with outliers in the practical applications, an outlier and change-point detection algorithm for time series, which is based on the wavelet transform of the efficient score vector, is proposed in this paper. The algorithm introduces the efficient score vector to solve the problem of the conventional detection method that statistics often increass infinitely with the number of data added during the process of detection, and proposes a strategy of analyzing the statistics by using wavelet in order to avoid the serious time delay. In order to distinguish the outlier and change-point during the detection process, we propose a detecting framework based on the relationship between Lipschitz exponent and the wavelet coefficients, by which both outlier and change-point can be detected out meanwhile. The advantage of this method is that the detection effect is not subject to the influence of the outlier. It means that the method can deal with the time series containing both outliers and change-points under actual operating conditions and it is more suitable for the real application. Eventually, the effectiveness and practicality of the proposed detection method have been proved through simulation results.
引用
收藏
页码:781 / 788
页数:7
相关论文
共 21 条
[1]  
Shao J., Rong G., Lee J., Learning a data-dependent kernel function for KPCA-based nonlinear process monitoring, Chemical Engineering Research & Design, 87, A, pp. 1471-1480, (2009)
[2]  
Zou B., Liu Q., ARMA-based traffic prediction and overload detection of network, Journal of Computer Research and Development, 39, 12, pp. 1645-1652, (2002)
[3]  
Zou X., Deng Z., Ge M., Et al., GPS data processing of networks with mixed single-and dual-frequency receivers for deformation monitoring, Advances in Space Research, 46, 2, pp. 130-135, (2010)
[4]  
Barnet V., Lewis T., Outlier in Statistical Data, (1994)
[5]  
Knorr E.M., Ng R.T., Finding intentional knowledge of distance-based outliers, Proc of the 25th Int Conf on Very Large Data Bases, pp. 211-222, (1999)
[6]  
Ramaswamy S., Rastogi R., Shim K., Efficient algorithms for mining outliers from large data sets, Proc of the ACM SIGMOD Int Conf on Management of Data, pp. 427-438, (2000)
[7]  
Markou M., Singh S., Novelty detection: A review - Part 2: Neural network based approaches, Signal Processing, 83, 12, pp. 2499-2521, (2003)
[8]  
Mourao-Miranda J., Hardoon D.R., Hahn T., Et al., Patient classification as an outlier detection problem: An application of the one-class support vector machine, Neuroimage, 58, 3, pp. 793-804, (2011)
[9]  
Wang J.S., Chiang J.C., A cluster validity measure with outlier detection for support vector clustering, IEEE Trans on Systems Man and Cybernetics, Part B-Cybernetics, 38, 1, pp. 78-89, (2008)
[10]  
Percival D.B., Walden A.T., Wavelet Methods for Time Series Analysis, (2006)