Probabilistic Convex Hull Queries over Uncertain Data

被引:5
|
作者
Yan, Da [1 ]
Zhao, Zhou [1 ]
Ng, Wilfred [1 ]
Liu, Steven [2 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Hong Kong, Peoples R China
[2] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY USA
关键词
Convex hull; uncertain data; Gibbs sampling;
D O I
10.1109/TKDE.2014.2340408
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The convex hull of a set of two-dimensional points, P, is the minimal convex polygon that contains all the points in P. Convex hull is important in many applications such as GIS, statistical analysis and data mining. Due to the ubiquity of data uncertainty such as location uncertainty in real-world applications, we study the concept of convex hull over uncertain data in 2D space. We propose the Probabilistic Convex Hull (PCH) query and demonstrate its applications, such as Flickr landscape photo extraction and activity region visualization, where location uncertainty is incurred by GPS devices or sensors. To tackle the problem of possible world explosion, we develop an O(N-3) algorithm based on geometric properties, where N is the data size. We further improve this algorithm with spatial indices and effective pruning techniques, which prune the majority of data instances. To achieve better time complexity, we propose another O(N-2 log N) algorithm, by maintaining a probability oracle in the form of a circular array with nice properties. Finally, to support applications that require fast response, we develop a Gibbs-sampling-based approximation algorithm which efficiently finds the PCH with high accuracy. Extensive experiments are conducted to verify the efficiency of our algorithms for answering PCH queries.
引用
收藏
页码:852 / 865
页数:14
相关论文
共 50 条
  • [11] Sliding-Window Probabilistic Threshold Aggregate Queries on Uncertain Data Streams
    Chen, Donghui
    Chen, Ling
    INFORMATION SCIENCES, 2020, 520 (520) : 353 - 372
  • [12] Probabilistic Threshold Join over Distributed Uncertain Data
    Deng, Lei
    Wang, Fei
    Huang, Benxiong
    WEB-AGE INFORMATION MANAGEMENT, 2011, 6897 : 68 - 80
  • [13] GDPS: An Efficient Approach for Skyline Queries over Distributed Uncertain Data
    Li, Xiaoyong
    Wang, Yijie
    Li, Xiaoling
    Wang, Xiaowei
    yu, Jie
    BIG DATA RESEARCH, 2014, 1 (01) : 23 - 36
  • [14] Efficient and Progressive Algorithms for Distributed Skyline Queries over Uncertain Data
    Ding, Xiaofeng
    Jin, Hai
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (08) : 1448 - 1462
  • [15] Ranking queries on uncertain data
    Hua, Ming
    Pei, Jian
    Lin, Xuemin
    VLDB JOURNAL, 2011, 20 (01): : 129 - 153
  • [16] Range queries on uncertain data
    Li, Jian
    Wang, Haitao
    THEORETICAL COMPUTER SCIENCE, 2016, 609 : 32 - 48
  • [17] Ranking queries on uncertain data
    Ming Hua
    Jian Pei
    Xuemin Lin
    The VLDB Journal, 2011, 20 : 129 - 153
  • [18] Predictive Nearest Neighbor Queries over Uncertain Spatial-Temporal Data
    Zhu, Jinghua
    Wang, Xue
    Li, Yingshu
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2014, 2014, 8491 : 424 - 435
  • [19] Parallel skyline queries over uncertain data streams in cloud computing environments
    Li, Xiaoyong
    Wang, Yijie
    Li, Xiaoling
    Wang, Yuan
    INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2014, 10 (01) : 24 - 53
  • [20] Parallel n-of-N Skyline Queries over Uncertain Data Streams
    Liu, Jun
    Li, Xiaoyong
    Ren, Kaijun
    Song, Junqiang
    Zhang, Zongshuo
    DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA 2018), PT II, 2018, 11030 : 176 - 184