A Distributed and Cooperative NameNode Cluster for a Highly-Available Hadoop Distributed File System

被引:6
作者
Kim, Yonghwan [1 ]
Araragi, Tadashi [2 ]
Nakamura, Junya [3 ]
Masuzawa, Toshimitsu [1 ]
机构
[1] Osaka Univ, Grad Sch Informat Sci & Technol, Suita, Osaka 5650871, Japan
[2] Proassist Ltd, Osaka 5410043, Japan
[3] Toyohashi Univ Technol, Informat & Media Ctr, Toyohashi, Aichi 4418122, Japan
关键词
Hadoop; HDFS; high-availability; distributed NameNodes; automatic failover; load balancing;
D O I
10.1587/transinf.2014EDP7258
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, Hadoop has attracted much attention from engineers and researchers as an emerging and effective framework for Big Data. HDFS (Hadoop Distributed File System) can manage a huge amount of data with high performance and reliability using only commodity hardware. However, HDFS requires a single master node, called a NameNode, to manage the entire namespace (or all the i-nodes) of a file system. This causes the SPOF (Single Point Of Failure) problem because the file system becomes inaccessible when the NameNode fails. This also causes a bottleneck of efficiency since all the access requests to the file system have to contact the NameNode. Hadoop 2.0 resolves the SPOF problem by introducing manual failover based on two NameNodes, Active and Standby. However, it still has the efficiency bottleneck problem since all the access requests have to contact the Active in ordinary executions. It may also lose the advantage of using commodity hardware since the two NameNodes have to share a highly reliable sophisticated storage. In this paper, we propose a new HDFS architecture to resolve all the problems mentioned above.
引用
收藏
页码:835 / 851
页数:17
相关论文
共 19 条
[1]  
[Anonymous], 2012, IDC IVIEW IDC ANAL F
[2]  
Attiya Hagit, 2004, Distributed computing: fundamentals, simulations, and advancesort caee109636/caee109636-tail.tex | uniq -cdd topics, V19
[3]   Bigtable: A distributed storage system for structured data [J].
Chang, Fay ;
Dean, Jeffrey ;
Ghemawat, Sanjay ;
Hsieh, Wilson C. ;
Wallach, Deborah A. ;
Burrows, Mike ;
Chandra, Tushar ;
Fikes, Andrew ;
Gruber, Robert E. .
ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2008, 26 (02)
[4]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[5]  
Ghemawat S., 2003, Operating Systems Review, V37, P29, DOI 10.1145/1165389.945450
[6]   LINEARIZABILITY - A CORRECTNESS CONDITION FOR CONCURRENT OBJECTS [J].
HERLIHY, MP ;
WING, JM .
ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 1990, 12 (03) :463-492
[7]  
Junqueira FlavioP., 2009, Proceedings of the 28th ACM symposium on Principles of distributed computing, P4
[8]  
Kola G, 2005, LECT NOTES COMPUT SC, V3648, P442
[9]  
Lamport L., 2001, SIGACT News, V32, P51
[10]  
Lipcon Todd, 2012, HDFS3077