Probabilistic Convex Hull Queries over Uncertain Data

被引:5
|
作者
Yan, Da [1 ]
Zhao, Zhou [1 ]
Ng, Wilfred [1 ]
Liu, Steven [2 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Hong Kong, Peoples R China
[2] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY USA
关键词
Convex hull; uncertain data; Gibbs sampling;
D O I
10.1109/TKDE.2014.2340408
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The convex hull of a set of two-dimensional points, P, is the minimal convex polygon that contains all the points in P. Convex hull is important in many applications such as GIS, statistical analysis and data mining. Due to the ubiquity of data uncertainty such as location uncertainty in real-world applications, we study the concept of convex hull over uncertain data in 2D space. We propose the Probabilistic Convex Hull (PCH) query and demonstrate its applications, such as Flickr landscape photo extraction and activity region visualization, where location uncertainty is incurred by GPS devices or sensors. To tackle the problem of possible world explosion, we develop an O(N-3) algorithm based on geometric properties, where N is the data size. We further improve this algorithm with spatial indices and effective pruning techniques, which prune the majority of data instances. To achieve better time complexity, we propose another O(N-2 log N) algorithm, by maintaining a probability oracle in the form of a circular array with nice properties. Finally, to support applications that require fast response, we develop a Gibbs-sampling-based approximation algorithm which efficiently finds the PCH with high accuracy. Extensive experiments are conducted to verify the efficiency of our algorithms for answering PCH queries.
引用
收藏
页码:852 / 865
页数:14
相关论文
共 50 条
  • [21] THE CONVEX HULL OF A QUADRATIC CONSTRAINT OVER A POLYTOPE
    Santana, Asteroide
    Dey, Santanu S.
    SIAM JOURNAL ON OPTIMIZATION, 2020, 30 (04) : 2983 - 2997
  • [22] Continuous Monitoring of Top-k Dominating Queries over Uncertain Data Streams
    Li, Guohui
    Luo, Changyin
    Li, Jianjun
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2014, PT I, 2014, 8786 : 244 - 255
  • [23] Supporting ranking queries on uncertain and incomplete data
    Soliman, Mohamed A.
    Ilyas, Ihab F.
    Ben-David, Shalev
    VLDB JOURNAL, 2010, 19 (04): : 477 - 501
  • [24] Indexing Metric Uncertain Data for Range Queries
    Chen, Lu
    Gao, Yunjun
    Li, Xinhan
    Jensen, Christian S.
    Chen, Gang
    Zheng, Baihua
    SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 951 - 965
  • [25] Approximation algorithms for aggregate queries on uncertain data
    Chen D.
    Chen L.
    Wang J.
    Wu Y.
    Wang J.
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2018, 58 (03): : 231 - 236
  • [26] Supporting ranking queries on uncertain and incomplete data
    Mohamed A. Soliman
    Ihab F. Ilyas
    Shalev Ben-David
    The VLDB Journal, 2010, 19 : 477 - 501
  • [27] Aggregation Queries of Uncertain Data in Relational Databases
    Xie, Dong
    Xiao, Jie
    2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTER SCIENCE AND APPLICATION (FCSA 2011), VOL 3, 2011, : 69 - 71
  • [28] Query Rewriting on Aggregate Queries over Uncertain Database
    Xie, Dong
    Long, Hai
    COMPUTING AND INTELLIGENT SYSTEMS, PT IV, 2011, 234 : 25 - 31
  • [29] Query Rewriting on Aggregate Queries over Uncertain Database
    Xie, Dong
    Long, Hai
    2010 SECOND INTERNATIONAL CONFERENCE ON E-LEARNING, E-BUSINESS, ENTERPRISE INFORMATION SYSTEMS, AND E-GOVERNMENT (EEEE 2010), VOL I, 2010, : 368 - 371
  • [30] Parallelizing skyline queries over uncertain data streams with sliding window partitioning and grid index
    Li, Xiaoyong
    Wang, Yijie
    Li, Xiaoling
    Wang, Yuan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (02) : 277 - 309