Distributed Top-k Query Processing on Multi-dimensional Data with Keywords

被引:4
|
作者
Amagata, Daichi [1 ]
Hara, Takahiro [1 ]
Nishio, Shojiro [1 ]
机构
[1] Osaka Univ, Grad Sch Informat Sci & Technol, Dept Multimedia Engn, Suita, Osaka 565, Japan
来源
PROCEEDINGS OF THE 27TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT | 2015年
关键词
RETRIEVAL;
D O I
10.1145/2791347.2791355
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As we are in the big data era, techniques for retrieving only user desirable data objects from massive and diverse datasets is being required. Ranking queries, e.g., top-k queries, which rank data objects based on a user-specified scoring function, enable to find such interesting data for users, and have received significant attention due to its wide range of applications. While many techniques for both centralized and distributed top-k query processing have been developed, they do not consider query keywords, i.e., simply retrieving k data with the best score. Utilizing keywords, on the other hand, is a common approach in data (and information) retrieval. Despite of this fact, there is no study on retrieving top-k data containing all query keywords. We define, in this paper, a new query which enriches the conventional top-k queries, and propose some algorithms to solve the novel problem of how to efficiently retrieve k data objects with the best score and all query keywords from distributed databases. Extensive experiments on both real and synthetic data have demonstrated the efficiency and scalability of our algorithms in terms of communication cost and running time.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Distributed multi-dimensional probabilistic Top-k query processing in sensor networks
    Zhu, Jinghua
    Guan, Xuemin
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2012, 40 (SUPPL.1): : 389 - 393
  • [2] Continuous Multi-dimensional Top-k Query Processing in Sensor Networks
    Jiang, Hongbo
    Cheng, Jie
    Wang, Dan
    Wang, Chonggang
    Tan, Guang
    2011 PROCEEDINGS IEEE INFOCOM, 2011, : 793 - 801
  • [3] Improved Multi-Dimensional Top-K Query Processing Based on Data Prediction in Wireless Sensor Networks
    Zhang, Zhen-Jiang
    Jie, Jun-Ren
    Hao, Zi-Qi
    Liu, Yun
    JOURNAL OF INTERNET TECHNOLOGY, 2014, 15 (03): : 343 - 350
  • [4] Authentication of Multi-dimensional Top-K Query on Untrusted Server
    Zhu, Xiaoyu
    Wu, Jie
    Chang, Wei
    Wang, Guojun
    Liu, Qin
    2018 IEEE/ACM 26TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2018,
  • [5] Top-k query processing over uncertain data in distributed environments
    Sun, Yongjiao
    Yuan, Ye
    Wang, Guoren
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2012, 15 (04): : 429 - 446
  • [6] Uncertain top-k query processing in distributed environments
    Wang, Xite
    Shen, Derong
    Yu, Ge
    DISTRIBUTED AND PARALLEL DATABASES, 2016, 34 (04) : 567 - 589
  • [7] Efficient Distributed Top-k Query Processing with Caching
    Ryeng, Norvald H.
    Vlachou, Akrivi
    Doulkeridis, Christos
    Norvag, Kjetil
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, 2011, 6588 : 280 - 295
  • [8] Uncertain top-k query processing in distributed environments
    Xite Wang
    Derong Shen
    Ge Yu
    Distributed and Parallel Databases, 2016, 34 : 567 - 589
  • [9] Extracting Top-K Insights from Multi-dimensional Data
    Tang, Bo
    Han, Shi
    Yiu, Man Lung
    Ding, Rui
    Zhang, Dongmei
    SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, : 1509 - 1524
  • [10] Multi-dimensional top-k dominating queries
    Man Lung Yiu
    Nikos Mamoulis
    The VLDB Journal, 2009, 18 : 695 - 718