Learning-Based Sample Tuning for Approximate Query Processing in Interactive Data Exploration

被引:0
作者
Zhang, Hanbing [1 ]
Jing, Yinan [1 ]
He, Zhenying [1 ]
Zhang, Kai [1 ]
Wang, X. Sean [1 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200437, Peoples R China
基金
中国国家自然科学基金;
关键词
Measurement; Adaptation models; Costs; Tuners; Accuracy; Q-learning; Query processing; Optimization; Synthetic data; Approximate query processing; interactive data exploration; data analysis;
D O I
10.1109/TKDE.2023.3341451
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For interactive data exploration, approximate query processing (AQP) is a useful approach that usually uses samples to provide a timely response for queries by trading query accuracy. Existing AQP systems often materialize samples in the memory for reuse to speed up query processing. How to tune the samples according to the workload is one of the key problems in AQP. However, since the data exploration workload is so complex that it cannot be accurately predicted, existing sample tuning approaches cannot adapt to the changing workload very well. To address this problem, this paper proposes a deep reinforcement learning-based sample tuner, RL-STuner. When tuning samples, RL-STuner considers the workload changes from a global perspective and uses a Deep Q-learning Network (DQN) model to select an optimal sample set that has the maximum utility for the current workload. In addition, this paper proposes a set of optimization mechanisms to reduce the sample tuning cost. Experimental results on both real-world and synthetic datasets show that RL-STuner outperforms the existing sample tuning approaches and achieves 1.6x-5.2x improvements on query accuracy with a low tuning cost.
引用
收藏
页码:6532 / 6546
页数:15
相关论文
共 50 条
  • [21] Sample Balancing for Deep Learning-Based Visual Recognition
    Chen, Xin
    Weng, Jian
    Luo, Weiqi
    Lu, Wei
    Wu, Huimin
    Xu, Jiaming
    Tian, Qi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (10) : 3962 - 3976
  • [22] Machine Learning-Based Configuration Parameter Tuning on Hadoop System
    Chen, Chi-Ou
    Zhuo, Ye-Qi
    Yeh, Chao-Chun
    Lin, Che-Min
    Liao, Shih-wei
    2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 386 - 392
  • [23] Query Processing in a Mediator Based Framework for Linked Data Integration
    Vidal, Vania M. P.
    de Macedo, Jose A. F.
    Pinheiro, Joao C.
    Casanova, Marco A.
    Porto, Fabio
    INTERNATIONAL JOURNAL OF BUSINESS DATA COMMUNICATIONS AND NETWORKING, 2011, 7 (02) : 29 - 47
  • [24] Priority-Based Skyline Query Processing for Incomplete Data
    Liu, Chuang-Ming
    Pak, Denis
    Castellanos, Ari Ernesto Ortiz
    IDEAS 2021: 25TH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM, 2021, : 204 - 211
  • [25] Approximate query processing using multilayered data model to handle environmental constraints, privacy and avoiding inferences
    Narayanan, Muthukumar
    Madria, Sanjay Kumar
    Clair, Dan St.
    INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2007, 16 (02) : 177 - 228
  • [26] Learn-As-You-Go: Feedback-Driven Result Ranking and Query Refinement for Interactive Data Exploration
    Singh, Vikram
    Singh, Ajay
    6TH INTERNATIONAL CONFERENCE ON SMART COMPUTING AND COMMUNICATIONS, 2018, 125 : 550 - 559
  • [27] Aggregate Query Processing Algorithm on Incomplete Data Based on Denotational Semantics
    Zhang A.-Z.
    Li J.-Z.
    Gao H.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (02): : 406 - 420
  • [28] Query Processing of Geosocial Data in Location-Based Social Networks
    D'Ulizia, Arianna
    Grifoni, Patrizia
    Ferri, Fernando
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2022, 11 (01)
  • [29] Encrypted Data Caching and Learning Framework for Robust Federated Learning-Based Mobile Edge Computing
    Nguyen, Chi-Hieu
    Saputra, Yuris Mulya
    Hoang, Dinh Thai
    Nguyen, Diep N.
    Nguyen, Van-Dinh
    Xiao, Yong
    Dutkiewicz, Eryk
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2024, 32 (03) : 2705 - 2720
  • [30] I Choose You: Automated Hyperparameter Tuning for Deep Learning-Based Side-Channel Analysis
    Wu, Lichao
    Perin, Guilherme
    Picek, Stjepan
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2024, 12 (02) : 546 - 557