About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
Collaborative Computing: Networking, Applications and Worksharing. 18th EAI International Conference, CollaborateCom 2022, Hangzhou, China, October 15-16, 2022, Proceedings, Part II

Research Article

Deep Reinforcement Learning for Multi-UAV Exploration Under Energy Constraints

Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1007/978-3-031-24386-8_20,
        author={Yating Zhou and Dianxi Shi and Huanhuan Yang and Haomeng Hu and Shaowu Yang and Yongjun Zhang},
        title={Deep Reinforcement Learning for Multi-UAV Exploration Under Energy Constraints},
        proceedings={Collaborative Computing: Networking, Applications and Worksharing. 18th EAI International Conference, CollaborateCom 2022, Hangzhou, China, October 15-16, 2022, Proceedings, Part II},
        proceedings_a={COLLABORATECOM PART 2},
        year={2023},
        month={1},
        keywords={Multi-UAV exploration Deep reinforcement learning Energy constraints},
        doi={10.1007/978-3-031-24386-8_20}
    }
    
  • Yating Zhou
    Dianxi Shi
    Huanhuan Yang
    Haomeng Hu
    Shaowu Yang
    Yongjun Zhang
    Year: 2023
    Deep Reinforcement Learning for Multi-UAV Exploration Under Energy Constraints
    COLLABORATECOM PART 2
    Springer
    DOI: 10.1007/978-3-031-24386-8_20
Yating Zhou1, Dianxi Shi2,*, Huanhuan Yang1, Haomeng Hu1, Shaowu Yang1, Yongjun Zhang2
  • 1: College of Computer, National University of Defense Technology, Changsha
  • 2: Artificial Intelligence Research Center (AIRC), National Innovation Institute of Defense Technology (NIIDT)
*Contact email: dxshi@nudt.edu.cn

Abstract

Autonomous exploration is the essential task for various applications of unmanned aerial vehicles (UAVs), but there is currently a lack of available energy-constrained multi-UAV exploration methods. In this paper, we propose the RTN-Explorer, an environment exploration strategy that satisfies the energy constraints. The goal of environment exploration is to expand the scope of exploration as much as possible, while the goal of energy constraints is to make the UAV return to the landing zone before the energy is exhausted, so they are a pair of contradictory goals. To better balance these two goals, we use map centering, and local-global map processing methods to improve the system performance and use the minimum distance penalty function to make the multi-UAV system satisfy the energy constraints. We also use the map generator to generate different environment maps to improve generalization performance. A large number of simulation experiments verify the effectiveness and robustness of our method and show superior performance in benchmark comparison.

Keywords
Multi-UAV exploration Deep reinforcement learning Energy constraints
Published
2023-01-25
Appears in
SpringerLink
http://dx.doi.org/10.1007/978-3-031-24386-8_20
Copyright © 2022–2025 ICST
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL