Prefetching-based metadata management in Advanced Multitenant Hadoop

被引:0
|
作者
Minh Chau Nguyen
Heesun Won
Siwoon Son
Myeong-Seon Gil
Yang-Sae Moon
机构
[1] ETRI,BigData Intelligence Research Department
[2] Kangwon National University,Department of Computer Science
来源
The Journal of Supercomputing | 2019年 / 75卷
关键词
Big data; Hadoop; Metadata management; Multitenancy; Prefetching;
D O I
暂无
中图分类号
学科分类号
摘要
Metadata management is an essential part in Apache Hadoop. Performing optimization of metadata accesses enhances big data storing, processing and analyzing, especially in multitenant environments. Nevertheless, as environmental complexity increases, metadata management is becoming more challenging and costly because of the heavy performance issues. In this paper, we propose a novel approach to improve the performance of metadata management for Hadoop in the multitenant environment based on the prefetching mechanism. We create metadata access graphs based on historical access values, define access patterns and then perform prefetching potential items for the near-future requests to minimize the latency. We present a formal algorithm to apply the prefetching mechanism into the Hadoop system and perform the actual implementation on a recent Hadoop system. Experimental results show that the proposed approach can enable the high performance for metadata management as well as maintain advanced multitenancy features.
引用
收藏
页码:533 / 553
页数:20
相关论文
共 50 条
  • [41] HBA: Distributed metadata management for large cluster-based storage systems
    Zhu, Yifeng
    Jiang, Hong
    Wang, Jun
    Xian, Feng
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2008, 19 (06) : 750 - 763
  • [42] Study on Data Management of High-speed Railway Transport Equipment Based on Metadata
    Jia, Chaolong
    Xu, Weixiang
    Wang, Hanning
    FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE, PTS 1-4, 2011, 44-47 : 3771 - 3775
  • [43] Enabling proactive data management in virtualized Hadoop clusters based on predicted data activity patterns
    Kousiouris, George
    Vafiadis, George
    Varvarigou, Theodora
    2013 EIGHTH INTERNATIONAL CONFERENCE ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING (3PGCIC 2013), 2013, : 1 - 8
  • [44] Machine Learning-based Management of Cloud Applications in Hybrid Clouds: a Hadoop Case Study
    Avreskv, D. R.
    Pellegrini, Alessandro
    Di Sanzo, Pierangelo
    2017 IEEE 16TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2017, : 179 - 183
  • [45] An End-to-End Learning-Based Metadata Management Approach for Distributed File Systems
    Gao, Yuanning
    Gao, Xiaofeng
    Zhang, Ruisi
    Chen, Guihai
    IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (05) : 1021 - 1034
  • [46] An Efficient Ring-Based Metadata Management Policy for Large-Scale Distributed File Systems
    Gao, Yuanning
    Gao, Xiaofeng
    Yang, Xiaochun
    Liu, Jiaxi
    Chen, Guihai
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (09) : 1962 - 1974
  • [47] VG-Prefetcher Cache: Towards Edge-Based Time Series Data Management Using Visibility Graph Prefetching
    Bensalem, Akram
    D'Orazio, Laurent
    Lallet, Julien
    Enrici, Andrea
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT 36TH INTERNATIONAL CONFERENCE, SSDBM 2024, 2024,
  • [48] An Adaptive Metadata Management Scheme Based on Deep Reinforcement Learning for Large-Scale Distributed File Systems
    Huang, Xiuqi
    Gao, Yuanning
    Zhou, Xinyi
    Gao, Xiaofeng
    Chen, Guihai
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (06) : 2840 - 2853
  • [49] QMDS: a file system metadata management service supporting a graph data model-based query language
    Ames, Sasha
    Gokhale, Maya
    Maltzahn, Carlos
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2013, 28 (02) : 159 - 183
  • [50] Metadata and Knowledge Management Driven Web-Based Learning Information System Towards Web/E-Learning 3.0
    Rego, H.
    Moreira, T.
    Morales, E.
    Garcia, F. J.
    INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2010, 5 (02) : 36 - 44