Questions tagged [multi-agent-systems]
For questions related to multi-agent systems (MAS), which are systems that involve multiple agents (each of them can have different skills) that cooperate with each other and interact with the environment. There are several challenges faced by MAS, including coordination between agents, security, and task allocation. Multi-agent systems have been applied in areas such as computer science, civil engineering, and electrical engineering.
53 questions
0 votes
0 answers
14 views
Rethinking My Deep-Research Agent Workflow — Should We Move Beyond Static Trees?
I’m reevaluating a deep-research workflow I built earlier and would love some advice. My previous design used a static tree workflow (fixed width/depth, node = search → extract → summarize → generate ...
0 votes
0 answers
27 views
Irreducibility of an island model with periodic migration
Let $K$ be the one-step kernel of a single AlphaEvolve agent acting on the set $\mathcal S := \mathcal P_{\mathrm{good}}$ of programs that pass evaluation, and assume $K$ is irreducible. Run $k \ge 2$ ...
0 votes
1 answer
80 views
Creating a Dynamic Heat Map for Multi-Agent Path Planning
I’m working on a system where agents must move scattered objects (aggregates) from various locations (cells) to a target zone. The environment is divided into a grid of cells, where each cell contains ...
2 votes
1 answer
687 views
Q learning (DQN) strategy for a multiplayer zero-sum game
I have been looking for ways to train a Q-learning agent for a multiplayer zero-sum game (a variation of Tic-Tac-Toe in my case). I came up with a learning strategy I haven't found anywhere else, and ...
0 votes
1 answer
328 views
Use your own simulation to train a reinforcement learning multi-agent
I am wanting to train an RL multi-agent model to run in a propietary simulation, which is written in C++. Is there a way to change the simulation itself to create an agent, or must I use a ...
1 vote
0 answers
233 views
How to correctly train policies in multi-agent RL?
I am diving into Multi-Agent Reinforcement Learning and after reading some literature, I would like to clarify some approaches because I am not quite sure. Now for the following two cases it is clear ...
1 vote
2 answers
236 views
What is the relation between non-stationarity and "a moving target problem" in multi-agent reinforcement learning?
If single-agent RL algorithms such as Q-learning is applied to multi-agent systems (e.g. Markov games), the environment from the perspective of the agent is non-stationary, and the agent is faced with ...
1 vote
0 answers
197 views
Has anyone here tried to implement MADDPG for a different environment and succeeded?
Has anyone tried implementing the multi-agent RL algorithm MADDPG (I've linked the paper below)? The paper seems to have a good amount citations, and they do have their code on github. However, a few ...
1 vote
1 answer
311 views
How can rewards and loss calculation be extended to multiple agents in a vanilla policy gradient RL setting?
Say I have a simple multi-agent reinforcement learning problem using vanilla policy gradient methods (i.e. REINFORCE) that is currently running with one network per agent. If I can say that each of my ...
1 vote
0 answers
99 views
Book/course recommendation on game theory application to multi-agent system (reinforcement learning)
Is there any great game theory book or course that discusses the application of game theory to modern reinforcement learning or multi-agent systems? Or a classic reference book that can help me get a ...
0 votes
1 answer
261 views
Can a Reinforcement Learning problem with multiple simultaneous actions be formalized as a Multiagent Partially Observable Markov Decision Process?
Consider the following decision making problem. We have a controller that selects locations from a grid of coordinates and captures an image (observation $o_t$) with a camera at each location (action $...
1 vote
0 answers
344 views
How to parallelize multi-agent DDPG (MADDPG)
I am experimenting with MADDPG algorithm implemented in this repo. Since there were only a few agents (2-3) in the implementation (also in the original paper) steps like parameter updates, action ...
0 votes
1 answer
182 views
How to implement a rule-based decision maker for an agent-based model?
I had no idea that there is a stack exchange community for A.I. :-/ So I repost this question here in hope of some guidelines. I tried to delve into the materials discussed in AI: A Modern Approach ...
1 vote
0 answers
327 views
RLLib - What exactly do the avail_action and action_embed_size represent? How do they work with the action_mask to phase out invalid actions?
So, I'm fairly new to reinforcement learning and I needed some help/explanations as to what the action_mask and avail_action fields alongside the action_embed_size actually mean in RLlib (the ...
0 votes
0 answers
716 views
RLlib's Multi-agent PPO continuous actions turn into nan
After some amount of training on a custom Multi-agent sparse-reward environment using RLlib's (1.4.0) PPO network, I found that my continuous actions turn into nan (explodes?) which is probably caused ...