Semantics-Aware Prediction for Analytic Queries in MapReduce Environment

被引:6
|
作者
Yu, Weikuan [1 ]
Liu, Zhuo [2 ]
Ding, Xiaoning [3 ]
机构
[1] Florida State Univ, Tallahassee, FL 32306 USA
[2] Auburn Univ, Auburn, AL 36849 USA
[3] New Jersey Inst Technol, Newark, NJ 07102 USA
来源
47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP '18) | 2018年
基金
美国国家科学基金会;
关键词
MapReduce; Semantics-Aware; Analytics Query; Scheduling;
D O I
10.1145/3229710.3229713
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
MapReduce has emerged as a powerful data processing engine that supports large-scale complex analytics applications, most of which are written in declarative query languages such as HiveQL and Pig Latin. Analytic queries are typically compiled into execution plans in the form of directed acyclic graphs (DAGs) of MapReduce jobs. Jobs in the DAGs are dispatched to the MapReduce processing engine as soon as their dependencies are satisfied. MapReduce adopts a job-level scheduling policy to strive for balanced distribution of tasks and effective utilization of resources. However, there is a lack of query-level semantics in the purely task-based scheduling algorithms, resulting in resource thrashing among queries and an overall degradation of performance. Therefore, we introduce a semantic-aware query prediction framework to address these problems systematically. Our framework includes three major techniques: cross-layer semantics percolation, selectivity estimation, and multivariate time prediction for analytic queries. Multivariate query prediction allows us not only to gauge the dynamic size of analytics datasets, but also to accurately predict the resource usage (e.g., numbers of map and reduce tasks) of individual MapReduce jobs and whole queries. In addition, the accurate prediction and queuing of queries can be potentially exploited by Hadoop scheduling for optimizing overall query performance. Based on the query prediction, our case study scheduler demonstrates significant performance improvement compared to traditional Hadoop schedulers.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Semantics-aware services for the mobile computing environment
    Georgantas, N
    Ben Mokhtar, S
    Tartanoglu, F
    Issarny, V
    ARCHITECTING DEPENDABLE SYSTEMS III, 2005, 3549 : 1 - 35
  • [2] Semantics-aware obfuscation scheme prediction for binary
    Zhao, Yujie
    Tang, Zhanyong
    Ye, Guixin
    Peng, Dongxu
    Fang, Dingyi
    Chen, Xiaojiang
    Wang, Zheng
    COMPUTERS & SECURITY, 2020, 99
  • [3] Semantics-Aware Autoencoder
    Bellini, Vito
    Di Noia, Tommaso
    Di Sciascio, Eugenio
    Schiavone, Angelo
    IEEE ACCESS, 2019, 7 : 166122 - 166137
  • [4] Attentive Review Semantics-Aware Recommendation Model for Rating Prediction
    Kim, Jihyeon
    Li, Xinzhe
    Jin, Li
    Li, Qinglong
    Kim, Jaekyeong
    ELECTRONICS, 2024, 13 (14)
  • [5] Semantics-aware malware detection
    Christodorescu, M
    Jha, S
    Seshia, SA
    Song, D
    Bryant, RE
    2005 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, PROCEEDINGS, 2005, : 32 - 46
  • [6] Semantics-Aware Trace Analysis
    Hoffman, Kevin
    Eugster, Patrick
    Jagannathan, Suresh
    PLDI'09 PROCEEDINGS OF THE 2009 ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION, 2009, : 453 - 464
  • [7] Semantics-aware perimeter protection
    Cremonini, M
    Damiani, E
    Samarati, P
    DATA AND APPLICATIONS SECURITY XVII: STATUS AND PROSPECTS, 2004, 142 : 229 - 242
  • [8] Semantics-Aware Trace Analysis
    Hoffman, Kevin
    Eugster, Patrick
    Jagannathan, Suresh
    ACM SIGPLAN NOTICES, 2009, 44 (06) : 453 - 464
  • [9] Sensitive Semantics-Aware Personality Cloaking on Road-Network Environment
    Li, Min
    Qin, Zhiguang
    Wang, Cong
    INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2014, 8 (01): : 133 - 146
  • [10] Semantics-Aware Warehousing of Symbolic Trajectories
    Trajcevski, Goce
    Donevska, Ivana
    Vaisman, Alejandro
    Avci, Besim
    Zhang, Tian
    Tian, Di
    PROCEEDINGS OF THE 6TH ACM SIGSPATIAL INTERNATIONAL WORKSHOP ON GEOSTREAMING (IWGS) 2015, 2015, : 1 - 8