On Distributed File Tree Walk of Parallel File Systems

被引:0
作者
LaFon, Jharrod [1 ,2 ]
Misra, Satyajayant [1 ]
Bringhurst, Jon [2 ]
机构
[1] New Mexico State Univ, Las Cruces, NM 88003 USA
[2] Los Alamos Natl Lab, Los Alamos, NM 87544 USA
来源
2012 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC) | 2012年
关键词
File Systems; Metadata; Parallel Algorithms; 1ST SEARCH; ALGORITHM;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Supercomputers generate vast amounts of data, typically organized into large directory hierarchies on parallel file systems. While the supercomputing applications are parallel, the tools used to process them requiring complete directory traversals, are typically serial. We present an algorithm framework and three fully distributed algorithms for traversing large parallel file systems, and performing file operations in parallel. The first algorithm introduces a randomized work-stealing scheduler; the second improves the first with proximity-awareness; and the third improves upon the second by using a hybrid approach. We have tested our implementation on Cielo, a 1.37 petaflop supercomputer at the Los Alamos National Laboratory and its 7 petabyte file system. Test results show that our algorithms execute orders of magnitude faster than state-of-the-art algorithms while achieving ideal load balancing and low communication cost. We present performance insights from the use of our algorithms in production systems at LANL, performing daily file system operations.
引用
收藏
页数:11
相关论文
共 25 条
  • [1] [Anonymous], ARXIV11044518
  • [2] [Anonymous], IEEE SCAL HIGH PERF
  • [3] [Anonymous], ACM IEEE C SUP
  • [4] [Anonymous], 2008, 100312008 IEEE
  • [5] [Anonymous], THINKING PARALLEL SO
  • [6] [Anonymous], IEEE FDN COMPUTER SC
  • [7] [Anonymous], 1991, MSCIS9171 U PENN DEP
  • [8] [Anonymous], RFC 1166 INTERNET NU
  • [9] [Anonymous], ACM IEEE C SUP IEEE
  • [10] [Anonymous], P 2005 ACM IEEE C SU, DOI 10.1109/SC.2005.4