Quality, Reliability, Security and Robustness in Heterogeneous Systems. 15th EAI International Conference, QShine 2019, Shenzhen, China, November 22–23, 2019, Proceedings

Research Article

Scheduling Virtual Machine Migration During Datacenter Upgrades with Reinforcement Learning

Download
73 downloads
  • @INPROCEEDINGS{10.1007/978-3-030-38819-5_7,
        author={Chen Ying and Baochun Li and Xiaodi Ke and Lei Guo},
        title={Scheduling Virtual Machine Migration During Datacenter Upgrades with Reinforcement Learning},
        proceedings={Quality, Reliability, Security and Robustness in Heterogeneous Systems. 15th EAI International Conference, QShine 2019, Shenzhen, China, November 22--23, 2019, Proceedings},
        proceedings_a={QSHINE},
        year={2020},
        month={1},
        keywords={Reinforcement learning Virtual machine migration},
        doi={10.1007/978-3-030-38819-5_7}
    }
    
  • Chen Ying
    Baochun Li
    Xiaodi Ke
    Lei Guo
    Year: 2020
    Scheduling Virtual Machine Migration During Datacenter Upgrades with Reinforcement Learning
    QSHINE
    Springer
    DOI: 10.1007/978-3-030-38819-5_7
Chen Ying1,*, Baochun Li1,*, Xiaodi Ke2,*, Lei Guo2,*
  • 1: University of Toronto
  • 2: Huawei Canada
*Contact email: chenying@ece.toronto.edu, bli@ece.toronto.edu, xiaodi.ke@huawei.com, leiguo@huawei.com

Abstract

Physical machines in modern datacenters are routinely upgraded due to their maintenance requirements, which involves migrating all the virtual machines they currently host to alternative physical machines. For this kind of datacenter upgrades, it is critical to minimize the time it takes to upgrade all the physical machines in the datacenter, so as to reduce disruptions to cloud services. To minimize the upgrade time, it is essential to carefully schedule the migration of virtual machines on each physical machine during its upgrade, without violating any constraints imposed by virtual machines that are currently running. Rather than resorting to heuristic algorithms, we propose a new scheduler, , that uses an experience-driven approach with deep reinforcement learning to schedule the virtual machine migration process. With our design of the state space, action space and reward function, trains a fully-connected neural network using the cross-entropy method to approximate the policy of a choosing destination physical machine for each migrating virtual machine. We compare with state-of-the-art heuristic algorithms in the literature, and our results show that effectively leads to shorter time to complete the datacenter upgrade process.