Abstract
Learning to communicate efficiently is central to multi-agent reinforcement learning (MARL). Existing methods often require agents to exchange messages intensively, which abuses communication channels and leads to high communication overhead. Only a few methods target on learning sparse communication, but they allow limited information to be shared, which affects the efficiency of policy learning. In this work, we propose model-based communication (MBC), a learning framework with a decentralized communication scheduling process. The MBC framework enables multiple agents to make decisions with sparse communication. In particular, the MBC framework introduces a model-based message estimator to estimate the up-to-date global messages using past local data. A decentralized message scheduling mechanism is also proposed to determine whether a message shall be sent based on the estimation. We evaluated our method in a variety of mixed cooperative-competitive environments. The experiment results show that the MBC method shows better performance and lower channel overhead than the state-of-art baselines.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems |
Publisher | International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS) |
Pages | 439–447 |
Number of pages | 9 |
Volume | 2023-May |
ISBN (Print) | 9781450394321 |
DOIs | |
Publication status | Published - 2023 |
Publication series
Name | Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS |
---|---|
ISSN (Print) | 1548-8403 |
Bibliographical note
Funding Information:We sincerely thank the anonymous reviewers. This work is partly funded by the China Scholarship Council (CSC).
Publisher Copyright:
© 2023 International Foundation for Autonomous Agents and Multiagent Systems (www.ifaamas.org). All rights reserved.
Keywords
- Communication Learning
- Message Scheduling
- Multi-Agent Reinforcement Learning
- Multi-Agent System