
Research Article
Multi-agent Quantum Reinforcement Learning for Digital Twin Placement in 6G Multi-tier Systems
@INPROCEEDINGS{10.1007/978-3-031-67357-3_6, author={Shehbaz Tariq and Muhammad Shohibul Ulum and Abdurrahman Wachid Shaffar and Wook Park and Sunghwan Kim and Hyundong Shin}, title={Multi-agent Quantum Reinforcement Learning for Digital Twin Placement in 6G Multi-tier Systems}, proceedings={Industrial Networks and Intelligent Systems. 10th EAI International Conference, INISCOM 2024, Da Nang, Vietnam, February 20--21, 2024, Proceedings}, proceedings_a={INISCOM}, year={2024}, month={7}, keywords={Digital twin Multi-tier computing Quantum reinforcement learning Digital twin placement}, doi={10.1007/978-3-031-67357-3_6} }
- Shehbaz Tariq
Muhammad Shohibul Ulum
Abdurrahman Wachid Shaffar
Wook Park
Sunghwan Kim
Hyundong Shin
Year: 2024
Multi-agent Quantum Reinforcement Learning for Digital Twin Placement in 6G Multi-tier Systems
INISCOM
Springer
DOI: 10.1007/978-3-031-67357-3_6
Abstract
The digital twins (DTs) represent critical components for the simulation, analysis, and optimization of physical systems, with significant implications for efficiency and cost management in 6G network applications. The introduction of multi-tier computing has the potential to enable a more streamlined integration of DTs in 6G networks by offering services at the edge network level. However, this integration brings about various complexities related to the placement and maintenance of DTs within edge networks, thus increasing the processing latency. To address these problems, we investigate the application of quantum computing, which exploits quantum principles such as superposition and entanglement, and offers potential resolutions for these computational dilemmas. In this paper, we formulate the DT placement problem to incorporate variational quantum circuits for learning agents within a multi-agent reinforcement learning framework. In particular, quantum multi-agent reinforcement learning is proposed to establish an optimal policy for associating DTs with edge networks. This approach aims to reduce latency while adhering to the computational resource limitations of the edge server. Simulation results illustrate the proficiency and robustness of quantum multi-agent actor-critic networks in acquiring a policy that ameliorates the reward function, hence decreasing latency while adhering to the optimization constraints. This study contributes to the evolving field of quantum computing applications in multi-tier environments and provides methodological insights for optimizing DT deployment in 6G networks.