A Metadata Update Strategy for Large Directories in Wide-area File Systems

被引:0
作者
Liu, Guoliang [1 ,2 ]
Liu, Zhenjun [1 ]
Ma, Liuying [1 ,2 ]
Zhang, Shuai [1 ,2 ]
Huang, Jing [1 ,2 ]
Bao, Xiuguo [3 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Natl Comp Network, Emergency Response Tech Team, Coordinat Ctr China, Beijing, Peoples R China
来源
2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS) | 2014年
关键词
large directory; metadata; deduplication; caching; FUSE;
D O I
10.1109/HPCC.2014.67
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the era of Big Data, one single directory can contain tens of thousands or even millions of files. Directory update methods employed by traditional POSIX-compliant file systems do not adapt well to wide-area network and could lead to large amount of unnecessary bandwidth consumption due to their full cache invalidation approaches. In traditional file systems, if there is a small fraction of changes in a directory (e.g. renaming a file), the whole cache of the directory metadata must be discarded and a new copy will be obtained from server, resulting in poor performance in low bandwidth environment. In this paper, we propose a directory metadata update strategy, which partitions metadata into blocks, and only transfers modified block(s) over the network to reduce transmission time. We implement a proof-of-concept prototype using the FUSE userspace filesystem to verify the effectiveness of our approach. Results show that for a directory with directory entries size of 5MB, update time for small changes in a directory can be roughly reduced by a factor of 20.
引用
收藏
页码:400 / 403
页数:4
相关论文
共 9 条
[1]  
Adya A, 2002, USENIX ASSOCIATION PROCEEDINGS OF THE FIFTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P1
[2]  
Annapureddy S., 2005, P 2 S NETW SYST DES
[3]  
Demmer M, 2008, PROCEEDINGS OF THE 6TH USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES (FAST '08), P35
[4]  
Meister D., 2012, P INT C HIGH PERF CO
[5]   A Study of Practical Deduplication [J].
Meyer, Dutch T. ;
Bolosky, William J. .
ACM TRANSACTIONS ON STORAGE, 2012, 7 (04)
[6]  
Muthitacharoen A., 2001, ACM SIGOPS OPERATING, V35
[7]  
Quinlan S., 2002, C FIL STOR TECHN FAS
[8]  
Stribling J., 2009, NSDI, P43
[9]  
Tolia N, 2003, USENIX ASSOCIATION PROCEEDINGS OF THE GENERAL TRACK, P127