Outlier detection from large distributed databases

被引:0
作者
Ji Zhang
Xiaohui Tao
Hua Wang
机构
[1] University of Southern Queensland,Department of Mathematics and Computing
来源
World Wide Web | 2014年 / 17卷
关键词
Data mining; Distributed database; Outlier detection;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we present an innovative system, coined as DISTROD (a.k.a DISTRibuted Outlier Detector), for detecting outliers, namely abnormal instances or observations, from multiple large distributed databases. DISTROD is able to effectively detect the so-called global outliers from distributed databases that are consistent with those produced by the centralized detection paradigm. DISTROD is equipped with a number of optimization/boosting strategies which empower it to significantly enhance its speed performance and reduce its communication overhead. Experimental evaluation demonstrates the good performance of DISTROD in terms of speed and communication overhead.
引用
收藏
页码:539 / 568
页数:29
相关论文
共 10 条
[1]  
Koufakou A(2010)A fast outlier detection strategy for distributed high-dimensional data sets with mixed attributes Data Min. Knowl. Disc. 20 259-289
[2]  
Georgiopoulos M(2006)Fast distributed outlier detection in mixed attribute data sets Data Min. Knowl. Disc. 12 203-228
[3]  
Otey M(2006)Detecting outlying subspaces for high-dimensional data: the new task, algorithms and performance Knowl. Inf. Syst. (KAIS) 10 333-355
[4]  
Ghoting A(2005)Clustering in dynamic spatial databases J. Intell. Inf. Syst. (JIIS) 24 5-27
[5]  
Parthasarathy S(undefined)undefined undefined undefined undefined-undefined
[6]  
Zhang J(undefined)undefined undefined undefined undefined-undefined
[7]  
Wang H(undefined)undefined undefined undefined undefined-undefined
[8]  
Zhang J(undefined)undefined undefined undefined undefined-undefined
[9]  
Hsu W(undefined)undefined undefined undefined undefined-undefined
[10]  
Lee ML(undefined)undefined undefined undefined undefined-undefined