Skip to main content

Questions tagged [multi-agent-systems]

For questions related to multi-agent systems (MAS), which are systems that involve multiple agents (each of them can have different skills) that cooperate with each other and interact with the environment. There are several challenges faced by MAS, including coordination between agents, security, and task allocation. Multi-agent systems have been applied in areas such as computer science, civil engineering, and electrical engineering.

0 votes
0 answers
14 views

I’m reevaluating a deep-research workflow I built earlier and would love some advice. My previous design used a static tree workflow (fixed width/depth, node = search → extract → summarize → generate ...
Gosh Li's user avatar
0 votes
0 answers
27 views

Let $K$ be the one-step kernel of a single AlphaEvolve agent acting on the set $\mathcal S := \mathcal P_{\mathrm{good}}$ of programs that pass evaluation, and assume $K$ is irreducible. Run $k \ge 2$ ...
Ralph M's user avatar
  • 101
0 votes
1 answer
80 views

I’m working on a system where agents must move scattered objects (aggregates) from various locations (cells) to a target zone. The environment is divided into a grid of cells, where each cell contains ...
Nir Manor's user avatar
2 votes
1 answer
687 views

I have been looking for ways to train a Q-learning agent for a multiplayer zero-sum game (a variation of Tic-Tac-Toe in my case). I came up with a learning strategy I haven't found anywhere else, and ...
Vladislav Korecký's user avatar
0 votes
1 answer
328 views

I am wanting to train an RL multi-agent model to run in a propietary simulation, which is written in C++. Is there a way to change the simulation itself to create an agent, or must I use a ...
michael-c-michael's user avatar
1 vote
0 answers
233 views

I am diving into Multi-Agent Reinforcement Learning and after reading some literature, I would like to clarify some approaches because I am not quite sure. Now for the following two cases it is clear ...
thsolyt's user avatar
  • 11
1 vote
2 answers
236 views

If single-agent RL algorithms such as Q-learning is applied to multi-agent systems (e.g. Markov games), the environment from the perspective of the agent is non-stationary, and the agent is faced with ...
Necron's user avatar
  • 11
1 vote
0 answers
197 views

Has anyone tried implementing the multi-agent RL algorithm MADDPG (I've linked the paper below)? The paper seems to have a good amount citations, and they do have their code on github. However, a few ...
Confuse's user avatar
  • 111
1 vote
1 answer
311 views

Say I have a simple multi-agent reinforcement learning problem using vanilla policy gradient methods (i.e. REINFORCE) that is currently running with one network per agent. If I can say that each of my ...
Josh's user avatar
  • 99
1 vote
0 answers
99 views

Is there any great game theory book or course that discusses the application of game theory to modern reinforcement learning or multi-agent systems? Or a classic reference book that can help me get a ...
Maybe's user avatar
  • 471
0 votes
1 answer
261 views

Consider the following decision making problem. We have a controller that selects locations from a grid of coordinates and captures an image (observation $o_t$) with a camera at each location (action $...
Schlozma's user avatar
1 vote
0 answers
344 views

I am experimenting with MADDPG algorithm implemented in this repo. Since there were only a few agents (2-3) in the implementation (also in the original paper) steps like parameter updates, action ...
Mika's user avatar
  • 371
0 votes
1 answer
182 views

I had no idea that there is a stack exchange community for A.I. :-/ So I repost this question here in hope of some guidelines. I tried to delve into the materials discussed in AI: A Modern Approach ...
Vector Zita's user avatar
1 vote
0 answers
327 views

So, I'm fairly new to reinforcement learning and I needed some help/explanations as to what the action_mask and avail_action fields alongside the action_embed_size actually mean in RLlib (the ...
Akshay Ghosh's user avatar
0 votes
0 answers
716 views

After some amount of training on a custom Multi-agent sparse-reward environment using RLlib's (1.4.0) PPO network, I found that my continuous actions turn into nan (explodes?) which is probably caused ...
hridayns's user avatar
  • 243

15 30 50 per page