IMPROVED BISECTOR CLUSTERING OF UNCERTAIN DATA USING SDSA METHOD ON PARALLEL PROCESSORS

被引:0
作者
Lukic, Ivica [1 ]
Slavek, Ninoslav [1 ]
Koehler, Mirko [1 ]
机构
[1] JJ Strossmayer Univ Osijek, Fac Elect Engn, Osijek 31000, Croatia
来源
TEHNICKI VJESNIK-TECHNICAL GAZETTE | 2013年 / 20卷 / 02期
关键词
clustering; data mining; expected distance; parallel processing; uncertain data;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Clustering uncertain objects is a well researched field. This paper is concerned with clustering uncertain objects with 2D location uncertainty due to object movements. Location of moving object is reported periodically, thus location is uncertain and described with probability density function (PDF). Data about moving objects and their locations are placed in distributed databases. Number of uncertain objects can be very large and obtaining quality result within reasonable time is a challenging task. Basic clustering method is UK-means, in which all expected distances (ED) from objects to clusters are calculated. Thus UK-means is inefficient. To avoid ED calculations various pruning methods are proposed. A survey of existing clustering methods is given in this paper and a combination of two methods is proposed. The first method, called Segmentation of Data Set Area is combined with Improved Bisector pruning to improve execution time of clustering uncertain data. In SDSA method, data set area is divided in many small segments, and only objects in that small segment are observed. Using segments there is a possibility for parallel computing, because segments are mutually independent, thus each segment can be computed on different core of parallel processor. Experiments were conducted to evaluate the effectiveness of the combined methods.
引用
收藏
页码:255 / 261
页数:7
相关论文
共 19 条
  • [1] [Anonymous], 2004, Proceedings of the Thirtieth international conference on Very large data bases-Volume
  • [2] Chau M, 2006, LECT NOTES ARTIF INT, V3918, P199
  • [3] Querying imprecise data in moving object environments
    Cheng, R
    Kalashnikov, DV
    Prabhakar, S
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (09) : 1112 - 1127
  • [4] Dalvi Nilesh, 2004, VLDB, V30, P864
  • [5] VORONOI TREES AND CLUSTERING PROBLEMS
    DEHNE, F
    NOLTEMEIER, H
    [J]. INFORMATION SYSTEMS, 1987, 12 (02) : 171 - 175
  • [6] Ester M., 1996, DENSITY BASED ALGORI, DOI DOI 10.5555/3001460.3001507
  • [7] GENERALIZED MINKOWSKI METRICS FOR MIXED FEATURE-TYPE DATA-ANALYSIS
    ICHINO, M
    YAGUCHI, H
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1994, 24 (04): : 698 - 708
  • [8] Clustering Uncertain Data Using Voronoi Diagrams and R-Tree Index
    Kao, Ben
    Lee, Sau Dan
    Lee, Foris K. F.
    Cheung, David Wai-lok
    Ho, Wai-Shing
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (09) : 1219 - 1233
  • [9] Clustering Uncertain Data using Voronoi Diagrams
    Kao, Ben
    Lee, Sau Dan
    Cheung, David W.
    Ho, Wai-Shing
    Chan, K. F.
    [J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 333 - 342
  • [10] Korn F, 2000, SIGMOD REC, V29, P201, DOI 10.1145/335191.335415