About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
Mobile and Ubiquitous Systems: Computing, Networking and Services. 20th EAI International Conference, MobiQuitous 2023, Melbourne, VIC, Australia, November 14–17, 2023, Proceedings, Part II

Research Article

Towards Distributed Control Under Deficient Communication with Multi-agent Reinforcement Learning

Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1007/978-3-031-63992-0_33,
        author={Arne Kummerow and Torben Weis},
        title={Towards Distributed Control Under Deficient Communication with Multi-agent Reinforcement Learning},
        proceedings={Mobile and Ubiquitous Systems: Computing, Networking and Services. 20th EAI International Conference, MobiQuitous 2023, Melbourne, VIC, Australia, November 14--17, 2023, Proceedings, Part II},
        proceedings_a={MOBIQUITOUS PART 2},
        year={2024},
        month={7},
        keywords={MARL Dec-MDP Dec-POMDP Communication Failure},
        doi={10.1007/978-3-031-63992-0_33}
    }
    
  • Arne Kummerow
    Torben Weis
    Year: 2024
    Towards Distributed Control Under Deficient Communication with Multi-agent Reinforcement Learning
    MOBIQUITOUS PART 2
    Springer
    DOI: 10.1007/978-3-031-63992-0_33
Arne Kummerow1,*, Torben Weis1
  • 1: University of Duisburg-Essen, Bismarckstraße 90
*Contact email: arne.kummerow@uni-due.de

Abstract

Multi-agent reinforcement learning solves optimization problems in sequential decision-making and enables controlling spatially distributed actuators. We consider a cooperative setting where agents can exchange information via a central controller, e.g. a cloud-based service. In real world applications however, communication channels are often error-prone and agents may become disconnected and can neither send its observation nor receive observations from other agents. We formalize this problem as a subclass of decentralized Markov decision processes and discuss the complexity of the problem. We then propose several solution concepts that involve breaking down the complexity by considering only a subset of failure scenarios, learning independent policies for each failure scenario, reconstructing missing information and learning policies that incorporate the state uncertainty in the training process.

Keywords
MARL Dec-MDP Dec-POMDP Communication Failure
Published
2024-07-19
Appears in
SpringerLink
http://dx.doi.org/10.1007/978-3-031-63992-0_33
Copyright © 2023–2025 ICST
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL