
Research Article
Air-Ground Edge Task Offloading Based on Multi-UAV Path Optimization and Resource Allocation
@INPROCEEDINGS{10.1007/978-3-031-86203-8_16, author={Chenguang He and Jing Li and Shouming Wei and Mengrui Guo}, title={Air-Ground Edge Task Offloading Based on Multi-UAV Path Optimization and Resource Allocation}, proceedings={Wireless and Satellite Systems. 14th EAI International Conference, WiSATS 2024, Harbin, China, August 23--25, 2024, Proceedings, Part II}, proceedings_a={WISATS PART 2}, year={2025}, month={3}, keywords={Edge Computing Deep Reinforcement Learning Task Offloading}, doi={10.1007/978-3-031-86203-8_16} }
- Chenguang He
Jing Li
Shouming Wei
Mengrui Guo
Year: 2025
Air-Ground Edge Task Offloading Based on Multi-UAV Path Optimization and Resource Allocation
WISATS PART 2
Springer
DOI: 10.1007/978-3-031-86203-8_16
Abstract
In recent years, space-air-ground integrated network(SAGIN) has been recognized as a promising area in 6G research. In the air-ground component of SAGIN, airborne device such as unmanned aerial vehicles (UAVs) can provide task offloading and computing services to ground devices. Edge servers are installed on UAVs to provide data offloading services for ground devices. The high mobility of UAVs, compared to base stations and edge computing devices on the ground, makes it better able to provide timely and effective services to the devices. Considering the needs of delay-sensitive devices, this paper jointly allocates computational resources and designs UAV trajectories to achieve the goal of minimizing delay. In this paper, we use non-orthogonal multiple access (NOMA) technique in the uplink channel, which allows a UAV to serve multiple ground devices simultaneously. Both the UAV and the ground devices are moving, and the ground devices need to re-establish their connection to the UAV every once in a while. Based on the traditional Deep Reinforcement Learning (DRL) algorithm, this study proposes the Multi-Agent DRL (MADRL) algorithm to jointly determine the optimal 3D trajectory and computational resource allocation of UAVs.The MADRL algorithm achieves complete ground cooperation among multiple UAVs as agents in optimizing the latency by co-training the neural network, simplifying the network structure, and improving the training efficiency. Numerical results show that the proposed MADRL algorithm can converge under the system quality of service (QoS) constraints, and the convergence speed is faster than that of the traditional deep Q network (DQN) algorithm. The average total delay of the system can also be effectively reduced and converged in a multi-UAV scenario.