Fast Software Rejuvenation of Virtual Machine Monitors

被引:42
作者
Kourai, Kenichi [1 ]
Chiba, Shigeru [2 ]
机构
[1] Kyushu Inst Technol, Dept Creat Informat, Iizuka, Fukuoka 8208502, Japan
[2] Tokyo Inst Technol, Dept Math & Comp Sci, Meguro Ku, Tokyo 1528552, Japan
关键词
Operating systems; checkpoint/restart; main memory; availability; performance;
D O I
10.1109/TDSC.2010.20
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As server consolidation using virtual machines (VMs) is carried out, software aging of virtual machine monitors (VMMs) is becoming critical. Since a VMM is fundamental software for running VMs, its performance degradation or crash failure affects all VMs running on top of it. To counteract such software aging, a proactive technique called software rejuvenation has been proposed. A simple example of rejuvenation is to reboot a VMM. However, simply rebooting a VMM is undesirable because that needs rebooting operating systems on all VMs. In this paper, we propose a new technique for fast rejuvenation of VMMs called the warm-VM reboot. The warm-VM reboot enables efficiently rebooting only a VMM by suspending and resuming VMs without saving the memory images to persistent storage. To achieve this, we have developed two mechanisms: on-memory suspend/resume of VMs and quick reload of a VMM. Compared with a normal reboot, the warm-VM reboot reduced the downtime by 74 percent at maximum. It also prevented the performance degradation due to cache misses after the reboot, which was 52 percent in case of a normal reboot. In a cluster environment, the warm-VM reboot achieved higher total throughput than the system using VM migration and a normal reboot.
引用
收藏
页码:839 / 851
页数:13
相关论文
共 34 条
  • [1] Accetta M., 1986, Proceedings of the Summer 1986 USENIX Conference, P93
  • [2] *AMD, 2005, AMD64 VIRT COD PAC
  • [3] [Anonymous], CS95302 U TENN
  • [4] [Anonymous], 1998, HTTPERF TOOL MEASURI, DOI DOI 10.1145/306225.306235
  • [5] BAKER M, 1992, PROCEEDINGS OF THE SUMMER 1992 USENIX CONFERENCE, P31
  • [6] Barham P., 2003, Operating Systems Review, V37, P164, DOI 10.1145/1165389.945462
  • [7] Candea G, 2004, USENIX Association Proceedings of the Sixth Symposium on Operating Systems Design and Implementation (OSDE '04), P31
  • [8] Proactive management of software aging
    Castelli, V
    Harper, RE
    Heidelberger, P
    Hunter, SW
    Trivedi, KS
    Vaidyanathan, K
    Zeggert, WP
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2001, 45 (02) : 311 - 332
  • [9] The Rio file cache: Surviving operating system crashes
    Chen, PM
    Ng, WT
    Chandra, S
    Aycock, C
    Rajamani, G
    Lowell, D
    [J]. ACM SIGPLAN NOTICES, 1996, 31 (09) : 74 - 83
  • [10] Chou Andy., 2001, P 18 ACM S OPERATING, P73