Prefetching-based metadata management in Advanced Multitenant Hadoop

被引:0
|
作者
Minh Chau Nguyen
Heesun Won
Siwoon Son
Myeong-Seon Gil
Yang-Sae Moon
机构
[1] ETRI,BigData Intelligence Research Department
[2] Kangwon National University,Department of Computer Science
来源
The Journal of Supercomputing | 2019年 / 75卷
关键词
Big data; Hadoop; Metadata management; Multitenancy; Prefetching;
D O I
暂无
中图分类号
学科分类号
摘要
Metadata management is an essential part in Apache Hadoop. Performing optimization of metadata accesses enhances big data storing, processing and analyzing, especially in multitenant environments. Nevertheless, as environmental complexity increases, metadata management is becoming more challenging and costly because of the heavy performance issues. In this paper, we propose a novel approach to improve the performance of metadata management for Hadoop in the multitenant environment based on the prefetching mechanism. We create metadata access graphs based on historical access values, define access patterns and then perform prefetching potential items for the near-future requests to minimize the latency. We present a formal algorithm to apply the prefetching mechanism into the Hadoop system and perform the actual implementation on a recent Hadoop system. Experimental results show that the proposed approach can enable the high performance for metadata management as well as maintain advanced multitenancy features.
引用
收藏
页码:533 / 553
页数:20
相关论文
共 50 条
  • [31] The Internet of Things based Medical Emergency Management using Hadoop Ecosystem
    Rathore, M. Mazhar
    Ahmad, Awais
    Paul, Anand
    2015 IEEE SENSORS, 2015, : 84 - 87
  • [32] Material Database Design and Realization Based on the Metadata Management of Electric Power
    Du, HaiZhou
    Zhang, DaQuan
    Wu, ChongTian
    NETWORK COMPUTING AND INFORMATION SECURITY, 2012, 345 : 626 - +
  • [33] AngleCut: A Ring-Based Hashing Scheme for Distributed Metadata Management
    Liu, Jiaxi
    Wang, Renxuan
    Gao, Xiaofeng
    Yang, Xiaochun
    Chen, Guihai
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017), PT I, 2017, 10177 : 71 - 86
  • [34] A Dynamic Repository Approach for Small File Management With Fast Access Time on Hadoop Cluster: Hash Based Extended Hadoop Archive
    Sharma, Vijay Shankar
    Afthanorhan, Asyraf
    Barwar, Nemi Chand
    Singh, Satyendra
    Malik, Hasmat
    IEEE ACCESS, 2022, 10 : 36856 - 36867
  • [35] Design and realization of bank history data management system based on Hadoop 2.0
    Meiwen Guo
    Cluster Computing, 2019, 22 : 8445 - 8451
  • [36] Design and realization of bank history data management system based on Hadoop 2.0
    Guo, Meiwen
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 4): : S8445 - S8451
  • [37] Pattern-Based Prefetching with Adaptive Cache Management Inside of Solid-State Drives
    Li, Jun
    Xu, Xiaofei
    Cai, Zhigang
    Liao, Jianwei
    Li, Kenli
    Gerofi, Balazs
    Ishikawa, Yutaka
    ACM TRANSACTIONS ON STORAGE, 2022, 18 (01)
  • [38] Metadata and knowledge management driven web-based learning information system
    Rego, Hugo
    Moreira, Tiago
    Garcia, Francisco
    Morales, Erla
    INTERNATIONAL JOURNAL OF TECHNOLOGY ENHANCED LEARNING, 2009, 1 (03) : 215 - 228
  • [39] Metadata and Knowledge Management Driven Web-Based Learning Information System
    Rego, Hugo
    Moreira, Tiago
    Morales, Erla
    Garcia, Francisco Jose
    OPEN KNOWLEDGE SOCIETY: A COMPUTER SCIENCE AND INFORMATION SYSTEMS MANIFESTO, 2008, 19 : 308 - 313
  • [40] A Novel Metadata Management Architecture Based on Service Separation in Cluster File System
    Zhang, Junwei
    Zhang, Jingliang
    Zhang, Jiangang
    Han, Xiaoming
    Xu, Lu
    2009 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT 2009), 2009, : 110 - 115