Applying File Information to Block-Level Content Addressable Storage

被引:1
作者
Zhang, Youhui [1 ]
Wang, Dongsheng [1 ]
机构
[1] Tsinghua National Laboratory for Information Science and Technology (TNList), Research Institute of Information Technology, Tsinghua University, Beijing
基金
中国国家自然科学基金;
关键词
content-addressable storage; smart disk; virtual machine; virtual storage;
D O I
10.1016/S1007-0214(09)70005-7
中图分类号
学科分类号
摘要
Content addressable storage (CAS) is a promising technology for improving storage efficiency as well as access throughput. Currently, many CAS products are implemented on the block level, which results in loss of file information. Thus, some sophisticated optimizations cannot be achieved, such as accurate file-prefetching. This paper presents a file-aware block-level storage system combined with the CAS function. In contrast with some existing file-level CAS, this system is transparent to upper-level applications, including the operating system and the file system. These features are achieved by using smart-disk technologies to help the storage system to learn the file-system layout. A prototype was implemented on an open-source virtual machine (VM) with the guest operating system being Windows XP. Tests show that this combination significantly reduces the size of the VM image file and improves the storage performance by discarding unused blocks and using a simple file-level prefetching strategy. © 2009 Tsinghua University Press.
引用
收藏
页码:41 / 49
页数:8
相关论文
共 20 条
[1]  
Clarke I., Sandberg O., Wiley B., Et al., Freenet: A distributed anonymous information storage and retrieval system, Proceedings of the Workshop on Design Issues in Anonymity and Unobservability, pp. 46-66, (2001)
[2]  
Ratnasamy S., Francis P., Handley M., Et al., A scalable content addressable network, Proceedings of ACM SIGCOMM 2001, pp. 161-172, (2001)
[3]  
Rowstron A., Druschel P., Pastry: Scalable, distributed object location and routing for large-scale peer-to-peer systems, Proceedings of the IFIP/ACM International Conference on Distributed Systems Platforms (Middleware), pp. 329-350, (2001)
[4]  
Tolia N., Kozuch M., Satyanarayanan M., Et al., Opportunistic use of content addressable storage for distributed file systems, Proceedings of the 2003 USENIX Annual Technical Conference, pp. 127-140, (2003)
[5]  
Muthitacharoen A., Chen B., Mazieres D., A low-bandwidth network file system, Proceedings of the 18th Symposium on Operating Systems Principles, (2001)
[6]  
Quinlan S., Dorward S., Venti: A new approach to archival storage, Proceedings of the FAST 2002 Conference on File and Storage Technologies, (2002)
[7]  
Cox L., Murray C., Noble B., Pastiche: Making backup cheap and easy, Proceedings of the 5th Symposium on Operating Systems Design and Implementation, pp. 285-298, (2002)
[8]  
Sivathanu M., Prabhakaran V., Popovici F., Et al., Semantically-smart disk systems, Proceedings of the Second USENIX Symposium on File and Storage Technologies (FAST'03), pp. 73-88, (2003)
[9]  
EMC Corporation, EMC centera content addressed storage system, (2003)
[10]  
You L., Pollack K., Long D., Deep store: An archival storage system architecture, Proceedings of the 21st IEEE International Conference on Data Engineering, pp. 804-815, (2005)