Fast Software Rejuvenation of Virtual Machine Monitors

被引:43
作者
Kourai, Kenichi [1 ]
Chiba, Shigeru [2 ]
机构
[1] Kyushu Inst Technol, Dept Creat Informat, Iizuka, Fukuoka 8208502, Japan
[2] Tokyo Inst Technol, Dept Math & Comp Sci, Meguro Ku, Tokyo 1528552, Japan
关键词
Operating systems; checkpoint/restart; main memory; availability; performance;
D O I
10.1109/TDSC.2010.20
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As server consolidation using virtual machines (VMs) is carried out, software aging of virtual machine monitors (VMMs) is becoming critical. Since a VMM is fundamental software for running VMs, its performance degradation or crash failure affects all VMs running on top of it. To counteract such software aging, a proactive technique called software rejuvenation has been proposed. A simple example of rejuvenation is to reboot a VMM. However, simply rebooting a VMM is undesirable because that needs rebooting operating systems on all VMs. In this paper, we propose a new technique for fast rejuvenation of VMMs called the warm-VM reboot. The warm-VM reboot enables efficiently rebooting only a VMM by suspending and resuming VMs without saving the memory images to persistent storage. To achieve this, we have developed two mechanisms: on-memory suspend/resume of VMs and quick reload of a VMM. Compared with a normal reboot, the warm-VM reboot reduced the downtime by 74 percent at maximum. It also prevented the performance degradation due to cache misses after the reboot, which was 52 percent in case of a normal reboot. In a cluster environment, the warm-VM reboot achieved higher total throughput than the system using VM migration and a normal reboot.
引用
收藏
页码:839 / 851
页数:13
相关论文
共 34 条
[21]  
HUANG YN, 1995, DIG PAP INT SYMP FAU, P381, DOI 10.1109/FTCS.1995.466961
[22]  
*INT CORP, 2005, INT VIRT TECHN SPEC
[23]  
*JBOSS GROUP, JBOSS APPL SERV
[24]   A fast rejuvenation technique for server consolidation with virtual machines [J].
Kourai, Kenichi ;
Chiba, Shigeru .
37TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2007, :245-+
[25]  
Li L, 2002, 2002 INTERNATIONAL SYMPOSIUM ON EMPIRICAL SOFTWARE ENGINEERING, PROCEEDINGS, P91, DOI 10.1109/ISESE.2002.1166929
[26]  
Pfiffer Andy., 2003, Reducing system reboot time with kexec
[27]  
Randell B., 1975, IEEE Transactions on Software Engineering, VSE-1, P220, DOI 10.1109/TSE.1975.6312842
[28]  
Swift M. M., 2003, Operating Systems Review, V37, P207, DOI 10.1145/1165389.945466
[29]  
The Apache Software Foundation, AP HTTP SERV PROJ
[30]  
Vaidyanathan K., 1999, Proceedings 10th International Symposium on Software Reliability Engineering (Cat. No.PR00443), P84, DOI 10.1109/ISSRE.1999.809313