Statistical Change Detection for Multi-Dimensional Data

被引:0
作者
Song, Xiuyao [1 ]
Wu, Mingxi [1 ]
Jermaine, Christopher [1 ]
Ranka, Sanjay [1 ]
机构
[1] Univ Florida, Dept Comp & Informat Sci & Engn, Gainesville, FL 32611 USA
来源
KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2007年
关键词
Change detection; density test; kernel density estimation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper deals with detecting change of distribution in multi-dimensional data sets. For a given baseline data set and a set of newly observed data points, we define a statistical test called the density test for deciding if the observed data points are sampled from the underlying distribution that produced the baseline data set. We define a test statistic that is strictly distribution-free under the null hypothesis. Our experimental results show that the density test has substantially more power than the two existing methods for multi-dimensional change detection.
引用
收藏
页码:667 / 676
页数:10
相关论文
共 21 条
[1]  
AGARWAL D, 2006, SODA
[2]  
AGARWAL D, 2006, SIGKDD
[3]  
[Anonymous], P ACM SIGMOD INT C M
[4]  
[Anonymous], 1997, A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models
[5]   Detecting group differences: Mining contrast sets [J].
Bay, SD ;
Pazzani, MJ .
DATA MINING AND KNOWLEDGE DISCOVERY, 2001, 5 (03) :213-246
[6]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[7]   An algorithm for data-driven bandwidth selection [J].
Comaniciu, D .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (02) :281-288
[8]  
DASU T, 2006, INTERFACE
[9]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[10]   Distance-based outliers: algorithms and applications [J].
Knorr, EM ;
Ng, RT ;
Tucakov, V .
VLDB JOURNAL, 2000, 8 (3-4) :237-253