Towards Distributed Control Under Deficient Communication with Multi-agent Reinforcement Learning

Arne Kummerow; Torben Weis

Mobile and Ubiquitous Systems: Computing, Networking and Services. 20th EAI International Conference, MobiQuitous 2023, Melbourne, VIC, Australia, November 14–17, 2023, Proceedings, Part II

Research Article

Towards Distributed Control Under Deficient Communication with Multi-agent Reinforcement Learning

Cite: BibTeX Plain Text

@INPROCEEDINGS{10.1007/978-3-031-63992-0_33,
    author={Arne Kummerow and Torben Weis},
    title={Towards Distributed Control Under Deficient Communication with Multi-agent Reinforcement Learning},
    proceedings={Mobile and Ubiquitous Systems: Computing, Networking and Services. 20th EAI International Conference, MobiQuitous 2023, Melbourne, VIC, Australia, November 14--17, 2023, Proceedings, Part II},
    proceedings_a={MOBIQUITOUS PART 2},
    year={2024},
    month={7},
    keywords={MARL Dec-MDP Dec-POMDP Communication Failure},
    doi={10.1007/978-3-031-63992-0_33}
}

Arne Kummerow
Torben Weis
Year: 2024
Towards Distributed Control Under Deficient Communication with Multi-agent Reinforcement Learning
MOBIQUITOUS PART 2
Springer
DOI: 10.1007/978-3-031-63992-0_33

Arne Kummerow¹^,*, Torben Weis¹

1: University of Duisburg-Essen, Bismarckstraße 90

*Contact email: arne.kummerow@uni-due.de

Abstract

Multi-agent reinforcement learning solves optimization problems in sequential decision-making and enables controlling spatially distributed actuators. We consider a cooperative setting where agents can exchange information via a central controller, e.g. a cloud-based service. In real world applications however, communication channels are often error-prone and agents may become disconnected and can neither send its observation nor receive observations from other agents. We formalize this problem as a subclass of decentralized Markov decision processes and discuss the complexity of the problem. We then propose several solution concepts that involve breaking down the complexity by considering only a subset of failure scenarios, learning independent policies for each failure scenario, reconstructing missing information and learning policies that incorporate the state uncertainty in the training process.

Keywords: MARL, Dec-MDP, Dec-POMDP, Communication Failure

Published: 2024-07-19
Appears in: SpringerLink

: http://dx.doi.org/10.1007/978-3-031-63992-0_33

Towards Distributed Control Under Deficient Communication with Multi-agent Reinforcement Learning

Abstract

About EAI

Community

Publish with EAI