Quality, Reliability, Security and Robustness in Heterogeneous Systems. 15th EAI International Conference, QShine 2019, Shenzhen, China, November 22–23, 2019, Proceedings

Research Article

A Reinforcement Learning Based Placement Strategy in Datacenter Networks

Download
70 downloads
  • @INPROCEEDINGS{10.1007/978-3-030-38819-5_6,
        author={Weihong Yang and Yang Qin and ZhaoZheng Yang},
        title={A Reinforcement Learning Based Placement Strategy in Datacenter Networks},
        proceedings={Quality, Reliability, Security and Robustness in Heterogeneous Systems. 15th EAI International Conference, QShine 2019, Shenzhen, China, November 22--23, 2019, Proceedings},
        proceedings_a={QSHINE},
        year={2020},
        month={1},
        keywords={Datacenter networks Placement strategy Q-learning},
        doi={10.1007/978-3-030-38819-5_6}
    }
    
  • Weihong Yang
    Yang Qin
    ZhaoZheng Yang
    Year: 2020
    A Reinforcement Learning Based Placement Strategy in Datacenter Networks
    QSHINE
    Springer
    DOI: 10.1007/978-3-030-38819-5_6
Weihong Yang1, Yang Qin1,*, ZhaoZheng Yang1
  • 1: Harbin Institute of Technology (Shenzhen)
*Contact email: csyqin@hit.edu.cn

Abstract

As the core infrastructure of cloud computing, the datacenter networks place heavy demands on efficient storage and management of massive data. Data placement strategy, which decides how to assign data to nodes for storage, has a significant impact on the performance of the datacenter. However, most of the existing solutions cannot be better adaptive to the dynamics of the network. Moreover, they focus on where to store the data (i.e., the selection of storage node) but have not considered how to store them (i.e., the selection of routing path). Since reinforcement learning (RL) has been developed as a promising solution to address dynamic network issues, in this paper, we integrate RL into the datacenter networks to deal with the data placement issue. Considering the dynamics of resources, we propose a Q-learning based data placement strategy for datacenter networks. By leveraging Q-learning, each node can adaptively select next-hop based on the network information collected from downstream, and forward the data toward the storage node that has adequate capacity along the path with high available bandwidth. We evaluate our proposal on the NS-3 simulator in terms of average delay, throughput, and load balance. Simulation results show that the Q-learning placement strategy can effectively reduce network delay and increase average throughout while achieving load-balanced among servers.