Detecting spatio-temporal outliers in climate dataset: A method study

被引:0
作者
Sun, YX [1 ]
Xie, KQ [1 ]
Ma, XJ [1 ]
Jin, XX [1 ]
Wen, P [1 ]
Gao, XP [1 ]
机构
[1] Peking Univ, Dept Intelligent Sci, Beijing 100871, Peoples R China
来源
IGARSS 2005: IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-8, PROCEEDINGS | 2005年
关键词
outlier detecting; spatio-temporal outlier;
D O I
暂无
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Outlier detecting is one of the most important data analysis technologies in data mining, which can he used to discover anomalous phenomena in huge dataset. Many literatures on spatial outlier detecting and time series outlier detecting have appeared, while the area of spatio-temporal outliers considering both spatial and temporal dimensions has still rarely been touched. Defining, outliers in traditional dataset is more explicit because the data structure we need to focus on is very straightforward (e.g., a spatial point or a transaction record). However, it is much more difficult to give outlier a definite characterization in spatio-temporal lattice data, since there are so many data structures we can pay attention to. With the aim of detecting useful and meaningful outliers in climate dataset, We introduce it formalized way to define outliers in spatio-temporal lattice data, in which the importance of clarifying basic data structure (we call it basic element in our paper) is stressed. As a case study, we define two kinds of spatio-temporal outliers based on a global climate dataset, according to the three aspects we propose in defining an outlier. The introduction of basic element and the formulation of outlier definition process make it easier and clearer to define meaningful outliers. Thus outlier detecting in spatio-temporal lattice data will provide us with really interesting and useful knowledge.
引用
收藏
页码:760 / 763
页数:4
相关论文
共 12 条
  • [1] Adam N, 2004, P 2004 ACM S APPL CO, P576, DOI DOI 10.1145/967900.968020
  • [2] AGGARWAL CC, 2001, ACM SIGMOD C
  • [3] Breunig M.M., 2000, ACM SIGMOD INT C MAN
  • [4] CHENG T, 2004, P 12 INT C GEOINF, P173
  • [5] Han J., 2012, Data Mining, P393, DOI [DOI 10.1016/B978-0-12-381479-1.00009-5, 10.1016/B978-0-12-381479-1.00001-0]
  • [6] Knorr E. M., 1998, Proceedings of the Twenty-Fourth International Conference on Very-Large Databases, P392
  • [7] Distance-based outliers: algorithms and applications
    Knorr, EM
    Ng, RT
    Tucakov, V
    [J]. VLDB JOURNAL, 2000, 8 (3-4) : 237 - 253
  • [8] Kou, 2003, P 11 ACM INT S ADV G, P49
  • [9] Lu C-T., 2004, P 12 ANN ACM INT WOR, P258
  • [10] MITCHELL TD, 2003, CRU TS 2 0 INTRO TYN