Prefetching-based metadata management in Advanced Multitenant Hadoop

被引:0
|
作者
Minh Chau Nguyen
Heesun Won
Siwoon Son
Myeong-Seon Gil
Yang-Sae Moon
机构
[1] ETRI,BigData Intelligence Research Department
[2] Kangwon National University,Department of Computer Science
来源
The Journal of Supercomputing | 2019年 / 75卷
关键词
Big data; Hadoop; Metadata management; Multitenancy; Prefetching;
D O I
暂无
中图分类号
学科分类号
摘要
Metadata management is an essential part in Apache Hadoop. Performing optimization of metadata accesses enhances big data storing, processing and analyzing, especially in multitenant environments. Nevertheless, as environmental complexity increases, metadata management is becoming more challenging and costly because of the heavy performance issues. In this paper, we propose a novel approach to improve the performance of metadata management for Hadoop in the multitenant environment based on the prefetching mechanism. We create metadata access graphs based on historical access values, define access patterns and then perform prefetching potential items for the near-future requests to minimize the latency. We present a formal algorithm to apply the prefetching mechanism into the Hadoop system and perform the actual implementation on a recent Hadoop system. Experimental results show that the proposed approach can enable the high performance for metadata management as well as maintain advanced multitenancy features.
引用
收藏
页码:533 / 553
页数:20
相关论文
共 50 条
  • [1] Prefetching-based metadata management in Advanced Multitenant Hadoop
    Minh Chau Nguyen
    Won, Heesun
    Son, Siwoon
    Gil, Myeong-Seon
    Moon, Yang-Sae
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (02) : 533 - 553
  • [2] Advanced Resource Management with Access Control for Multitenant Hadoop
    Won, Heesun
    Nguyen, Minh Chau
    Gil, Myeong-Seon
    Moon, Yang-Sae
    JOURNAL OF COMMUNICATIONS AND NETWORKS, 2015, 17 (06) : 592 - 601
  • [3] Advanced Multitenant Hadoop in Smart Open Data Platform
    Minh Chau Nguyen
    Won, Hee Sun
    INTERNATIONAL CONFERENCE ON BIG DATA AND INTERNET OF THINGS (BDIOT 2017), 2017, : 48 - 51
  • [4] A Prefetching-based Replication Algorithm in Data Grid
    Tian, Tian
    Luo, Junzhou
    Wu, Zhiang
    Song, Aibo
    2008 3RD INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND APPLICATIONS, VOLS 1 AND 2, 2008, : 528 - 533
  • [5] Data Prefetching for Scientific Workflow Based on Hadoop
    Chen, Gaozhao
    Wu, Shaochun
    Gu, Rongrong
    Xu, Yongquan
    Xu, Lingyu
    Ge, Yunwen
    Song, Cuicui
    COMPUTER AND INFORMATION SCIENCE 2012, 2012, 429 : 81 - 92
  • [6] Prefetching-Based Content Download for Highway Vehicular Ad Hoc Networks
    Guo, Tao
    Li, Changle
    Miao, Zhifang
    Dong, Weiwei
    Su, Xiaonan
    2017 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2017, : 1139 - 1144
  • [7] Dr.Hadoop:an infinite scalable metadata management for Hadoop—How the baby elephant becomes immortal
    Dipayan DEV
    Ripon PATGIRI
    FrontiersofInformationTechnology&ElectronicEngineering, 2016, 17 (01) : 15 - 31
  • [8] Dr. Hadoop: an infinite scalable metadata management for Hadoop—How the baby elephant becomes immortal
    Dipayan Dev
    Ripon Patgiri
    Frontiers of Information Technology & Electronic Engineering, 2016, 17 : 15 - 31
  • [9] Classification based Metadata Management for HDFS
    Chandrasekar, Ashok
    Chandrasekar, Karthik
    Ramasatagopan, Harini
    Rafica, A. R.
    Balasubramaniyan, Jagadeesh
    2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 1021 - 1026
  • [10] Dr. Hadoop: an infinite scalable metadata management for Hadoop-How the baby elephant becomes immortal
    Dev, Dipayan
    Patgiri, Ripon
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2016, 17 (01) : 15 - 31