TDEP: efficiently processing top-k dominating query on massive data

被引:0
|
作者
Xixian Han
Jianzhong Li
Hong Gao
机构
[1] Harbin Institute of Technology,School of Computer Science and Technology
来源
Knowledge and Information Systems | 2015年 / 43卷
关键词
Massive data; Top-; dominating query; TDEP algorithm; Sorted lists; Early pruning;
D O I
暂无
中图分类号
学科分类号
摘要
In many applications including multicriteria decision making, top-k dominating query is a practically useful tool to return k tuples with the highest domination scores in a potentially huge data space. The existing algorithms, either requiring indexes built on the specific attribute subset, or incurring high I/O cost or memory cost, cannot process top-k dominating query on massive data efficiently. In this paper, a novel algorithm TDEP is proposed to utilize sorted lists built for each attribute with low cost to return top-k dominating result on massive data efficiently. Through analysis, it is found that TDEP can be divided into two phases: growing phase and shrinking phase. In each phase, TDEP retrieves the sorted lists in round-robin fashion and maintains the candidates until the stop condition is satisfied. The theoretical analysis is provided for the execution behavior in two phases. An efficient method is developed to compute the domination scores of tuples with the obtained candidates only. Besides, TDEP adopts early pruning to reduce the number of candidate tuples maintained significantly. The extensive experimental results, conducted on synthetic and real-life data sets, show the significant performance advantage of TDEP over the existing algorithms.
引用
收藏
页码:689 / 718
页数:29
相关论文
共 50 条
  • [21] Top-k query processing in uncertain Databases
    Soliman, Mohamed A.
    Ilyas, Ihab F.
    Chang, Kevin Chen-Chuan
    2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 871 - +
  • [22] Top-k query processing over uncertain data in distributed environments
    Sun, Yongjiao
    Yuan, Ye
    Wang, Guoren
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2012, 15 (04): : 429 - 446
  • [23] TopX:: efficient and versatile top-k query processing for semistructured data
    Theobald, Martin
    Bast, Holger
    Majumdar, Debapriyo
    Schenkel, Ralf
    Weikum, Gerhard
    VLDB JOURNAL, 2008, 17 (01): : 81 - 115
  • [24] Efficient and Secure Top-k Query Processing on Hybrid Sensed Data
    Wu, Haiqin
    Wang, Liangmin
    MOBILE INFORMATION SYSTEMS, 2016, 2016
  • [25] TopX: efficient and versatile top-k query processing for semistructured data
    Martin Theobald
    Holger Bast
    Debapriyo Majumdar
    Ralf Schenkel
    Gerhard Weikum
    The VLDB Journal, 2008, 17 : 81 - 115
  • [26] An efficient massive data retrieval algorithm based on modified Top-k query
    Peng, Xiao
    2015 SEVENTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2015), 2015, : 102 - 105
  • [27] Processing Spatial Keyword Query as a Top-k Aggregation Query
    Zhang, Dongxiang
    Chan, Chee-Yong
    Tan, Kian-Lee
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 355 - 364
  • [28] On efficiently diversified top-k geo-social keyword query processing in road networks
    Zhao, Jingwen
    Gao, Yunjun
    Ma, Chunyu
    Jin, Pengfei
    Wen, Shiting
    INFORMATION SCIENCES, 2020, 512 : 813 - 829
  • [29] Uncertain top-k query processing in distributed environments
    Wang, Xite
    Shen, Derong
    Yu, Ge
    DISTRIBUTED AND PARALLEL DATABASES, 2016, 34 (04) : 567 - 589
  • [30] Joint Top-K Spatial Keyword Query Processing
    Wu, Dingming
    Yiu, Man Lung
    Cong, Gao
    Jensen, Christian S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (10) : 1889 - 1903