Approximate Continuous Top-K Queries over Memory Limitation-Based Streaming Data

被引:9
|
作者
Zhu, Rui [1 ]
Meng, Liu [1 ]
Wang, Bin [2 ]
Yang, Xiaochun [2 ]
Xia, Xiufeng [1 ]
机构
[1] Shenyang Aerosp Univ, Sch Comp Sci, Shenyang, Peoples R China
[2] Northeastern Univ, Coll Comp Sci & Engn, Shenyang, Peoples R China
关键词
Data stream; Continuous top-k query; Approximate; Memory limitation;
D O I
10.1007/978-3-031-00123-9_1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Continuous top-k query over sliding window is a fundamental problem over data stream. It retrieves k objects with the highest scores when the window slides. Existing efforts include exact-based algorithms and approximate-based algorithms. Their common idea is maintaining a small subset of objects in the window. When the window slides, query results could be found from this set as much as possible. However, the space cost of all existing efforts is high, i.e., linear to the scale of objects in the window, cannot work under memory limitation-based streaming data, i.e., a general environment in real applications. In this paper, we define a novel query named rho-approximate continuous top-k query, which returns error-bounded answers to the system. Here, rho is a threshold, used for bounding the score ratio between approximate and exact results. In order to support rho-approximate continuous top-k query, we propose a novel framework named rho-TOPK. It can selfadaptively adjust rho based on the distribution of streaming data, and achieve the goal of supporting rho-approximate continuous top-k query over memory limitation-based streaming data. Theoretical analysis indicates that even in the worsst case, both running cost and space cost of rho-TOPK are all unrelated with data scale.
引用
收藏
页码:3 / 20
页数:18
相关论文
共 50 条
  • [1] Approximate Continuous Skyline Queries over Memory Limitation-Based Streaming Data
    An, Yunzhe
    Zhen, Zhu
    Zhu, Rui
    Qiu, Tao
    Xia, Xiufeng
    WEB AND BIG DATA, PT III, APWEB-WAIM 2023, 2024, 14333 : 96 - 110
  • [2] SAP: Improving Continuous Top-K Queries Over Streaming Data
    Zhu, Rui
    Wang, Bin
    Yang, Xiaochun
    Zheng, Baihua
    Wang, Guoren
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (06) : 1310 - 1328
  • [3] EPF: A General Framework for Supporting Continuous Top-k Queries Over Streaming Data
    Hong Jiang
    Rui Zhu
    Bin Wang
    Cognitive Computation, 2020, 12 : 176 - 194
  • [4] EPF: A General Framework for Supporting Continuous Top-k Queries Over Streaming Data
    Jiang, Hong
    Zhu, Rui
    Wang, Bin
    COGNITIVE COMPUTATION, 2020, 12 (01) : 176 - 194
  • [5] SAP: Improving Continuous Top-K Queries over Streaming Data (Extended Abstract)
    Zhu, Rui
    Wang, Bin
    Yang, Xiaochun
    Zheng, Baihua
    Wang, Guoren
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1819 - 1820
  • [6] Scaling the monitoring of approximate top-k queries in streaming windows
    Ramperez, Victor
    Zahmatkesh, Shima
    Della Valle, Emanuele
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 181 - 189
  • [7] Top-K Frequent Term Queries on Streaming Data
    Farazi, Sara
    Rafiei, Davood
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1582 - 1585
  • [8] On Top-k Queries over Evidential Data
    Bousnina, Fatma Ezzahra
    Chebbah, Mouna
    Tobji, Mohamed Anis Bach
    Hadjali, Allel
    Ben Yaghlane, Boutheina
    ICEIS: PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 1, 2017, : 106 - 113
  • [9] Approximate distributed top-k queries
    Boaz Patt-Shamir
    Allon Shafrir
    Distributed Computing, 2008, 21 : 1 - 22
  • [10] Approximate distributed top-k queries
    Patt-Shamir, Boaz
    Shafrir, Allon
    DISTRIBUTED COMPUTING, 2008, 21 (01) : 1 - 22