RDMA-based Cooperative Caching for a Distributed File System

被引:0
作者
Sasaki, Shin [1 ]
Matsumiya, Ryo [1 ]
Takahashi, Kazushi [1 ]
Oyama, Yoshihiro [1 ]
Tatebe, Osamu [2 ]
机构
[1] Univ Electrocommun, Dept Informat, Tokyo, Japan
[2] Univ Tsukuba, Dept Comp Sci, Tsukuba, Ibaraki 305, Japan
来源
2015 IEEE 21ST INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS) | 2015年
关键词
Distributed file systems; file caching; cooperative caching; InfiniBand; RDMA; data-intensive applications; high-performance computing;
D O I
10.1109/ICPADS.2015.51
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient caching of file data is critical in order to achieve high performance in data-intensive applications. However, only a limited amount of memory is usually available to cache files in client nodes even on high-performance computing platforms. Cooperative caching is an approach that enables client nodes to share memory for file caching and thereby provide a large amount of memory for the file cache in the aggregate. Many studies have confirmed the efficacy of applying cooperative caching to distributed file systems. However, to the best of our knowledge, no study has evaluated an implementation of cooperative caching integrated into a modern distributed file system running on a high-speed network. In this paper, we propose a method that improves the performance of a distributed file system oriented to high-performance computing by integrating cooperative caching into it. In the proposed method, the metadata server of the distributed file system maintains information about the cache in all client nodes, and provides clients with the predicted cache location of any requested file. Further, InfiniBand RDMA is utilized to achieve fast cache transfer between the page caches of client nodes. Implementation of the proposed method in the Gfarm distributed file system and measurement of the performance of three real-world data-intensive applications indicate that the proposed method achieves a maximum speedup of 5.8%.
引用
收藏
页码:344 / 353
页数:10
相关论文
共 28 条
  • [1] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [2] Annapureddy S, 2005, USENIX Association Proceedings of the 2nd Symposium on Networked Systems Design & Implementation (NSDI '05), P129
  • [3] [Anonymous], P 26 IEEE S MASS STO
  • [4] [Anonymous], 2004, P 11 ECMWF WORKSH US
  • [5] [Anonymous], 2004, ASTROPHYSICAL J, V611
  • [6] NFS-CD: Write-enabled cooperative caching in NFS
    Batsakis, Alexandros
    Burns, Randal
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2008, 19 (03) : 323 - 333
  • [7] Berriman G. B., 2006, P 6 ANN NASA EARTH S
  • [8] Carns PH, 2000, USENIX ASSOCIATION PROCEEDINGS OF THE 4TH ANNUAL LINUX SHOWCASE AND CONFERENCE, ATLANTA, P317
  • [9] Dahlin M. D., 1994, P 1 USENIX C OP SYST
  • [10] Furht B, 2011, HANDBOOK OF DATA INTENSIVE COMPUTING, P1, DOI 10.1007/978-1-4614-1415-5