Fast Software Rejuvenation of Virtual Machine Monitors

被引:43
作者
Kourai, Kenichi [1 ]
Chiba, Shigeru [2 ]
机构
[1] Kyushu Inst Technol, Dept Creat Informat, Iizuka, Fukuoka 8208502, Japan
[2] Tokyo Inst Technol, Dept Math & Comp Sci, Meguro Ku, Tokyo 1528552, Japan
关键词
Operating systems; checkpoint/restart; main memory; availability; performance;
D O I
10.1109/TDSC.2010.20
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As server consolidation using virtual machines (VMs) is carried out, software aging of virtual machine monitors (VMMs) is becoming critical. Since a VMM is fundamental software for running VMs, its performance degradation or crash failure affects all VMs running on top of it. To counteract such software aging, a proactive technique called software rejuvenation has been proposed. A simple example of rejuvenation is to reboot a VMM. However, simply rebooting a VMM is undesirable because that needs rebooting operating systems on all VMs. In this paper, we propose a new technique for fast rejuvenation of VMMs called the warm-VM reboot. The warm-VM reboot enables efficiently rebooting only a VMM by suspending and resuming VMs without saving the memory images to persistent storage. To achieve this, we have developed two mechanisms: on-memory suspend/resume of VMs and quick reload of a VMM. Compared with a normal reboot, the warm-VM reboot reduced the downtime by 74 percent at maximum. It also prevented the performance degradation due to cache misses after the reboot, which was 52 percent in case of a normal reboot. In a cluster environment, the warm-VM reboot achieved higher total throughput than the system using VM migration and a normal reboot.
引用
收藏
页码:839 / 851
页数:13
相关论文
共 34 条
[1]  
Accetta M., 1986, Proceedings of the Summer 1986 USENIX Conference, P93
[2]  
*AMD, 2005, AMD64 VIRT COD PAC
[3]  
[Anonymous], CS95302 U TENN
[4]  
[Anonymous], 1998, HTTPERF TOOL MEASURI, DOI DOI 10.1145/306225.306235
[5]  
BAKER M, 1992, PROCEEDINGS OF THE SUMMER 1992 USENIX CONFERENCE, P31
[6]  
Barham P., 2003, Operating Systems Review, V37, P164, DOI 10.1145/1165389.945462
[7]  
Candea G, 2004, USENIX Association Proceedings of the Sixth Symposium on Operating Systems Design and Implementation (OSDE '04), P31
[8]   Proactive management of software aging [J].
Castelli, V ;
Harper, RE ;
Heidelberger, P ;
Hunter, SW ;
Trivedi, KS ;
Vaidyanathan, K ;
Zeggert, WP .
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2001, 45 (02) :311-332
[9]   The Rio file cache: Surviving operating system crashes [J].
Chen, PM ;
Ng, WT ;
Chandra, S ;
Aycock, C ;
Rajamani, G ;
Lowell, D .
ACM SIGPLAN NOTICES, 1996, 31 (09) :74-83
[10]  
Chou Andy., 2001, P 18 ACM S OPERATING, P73