Questions tagged [multi-agent-systems]

Question 1

I’m reevaluating a deep-research workflow I built earlier and would love some advice. My previous design used a static tree workflow (fixed width/depth, node = search → extract → summarize → generate ...

Question 2

Let $K$ be the one-step kernel of a single AlphaEvolve agent acting on the set $\mathcal S := \mathcal P_{\mathrm{good}}$ of programs that pass evaluation, and assume $K$ is irreducible. Run $k \ge 2$ ...

Question 3

I’m working on a system where agents must move scattered objects (aggregates) from various locations (cells) to a target zone. The environment is divided into a grid of cells, where each cell contains ...

Question 4

I have been looking for ways to train a Q-learning agent for a multiplayer zero-sum game (a variation of Tic-Tac-Toe in my case). I came up with a learning strategy I haven't found anywhere else, and ...

Question 5

I am wanting to train an RL multi-agent model to run in a propietary simulation, which is written in C++. Is there a way to change the simulation itself to create an agent, or must I use a ...

Question 6

I am diving into Multi-Agent Reinforcement Learning and after reading some literature, I would like to clarify some approaches because I am not quite sure. Now for the following two cases it is clear ...

Question 7

If single-agent RL algorithms such as Q-learning is applied to multi-agent systems (e.g. Markov games), the environment from the perspective of the agent is non-stationary, and the agent is faced with ...

Question 8

Has anyone tried implementing the multi-agent RL algorithm MADDPG (I've linked the paper below)? The paper seems to have a good amount citations, and they do have their code on github. However, a few ...

Question 9

Say I have a simple multi-agent reinforcement learning problem using vanilla policy gradient methods (i.e. REINFORCE) that is currently running with one network per agent. If I can say that each of my ...

Question 10

Is there any great game theory book or course that discusses the application of game theory to modern reinforcement learning or multi-agent systems? Or a classic reference book that can help me get a ...

Question 11

Consider the following decision making problem. We have a controller that selects locations from a grid of coordinates and captures an image (observation $o_t$) with a camera at each location (action $...

Question 12

I am experimenting with MADDPG algorithm implemented in this repo. Since there were only a few agents (2-3) in the implementation (also in the original paper) steps like parameter updates, action ...

Question 13

I had no idea that there is a stack exchange community for A.I. :-/ So I repost this question here in hope of some guidelines. I tried to delve into the materials discussed in AI: A Modern Approach ...

Question 14

So, I'm fairly new to reinforcement learning and I needed some help/explanations as to what the action_mask and avail_action fields alongside the action_embed_size actually mean in RLlib (the ...

Question 15

After some amount of training on a custom Multi-agent sparse-reward environment using RLlib's (1.4.0) PPO network, I found that my continuous actions turn into nan (explodes?) which is probably caused ...

Stack Exchange Network

Questions tagged [multi-agent-systems]

Rethinking My Deep-Research Agent Workflow — Should We Move Beyond Static Trees?

Irreducibility of an island model with periodic migration

Creating a Dynamic Heat Map for Multi-Agent Path Planning

Q learning (DQN) strategy for a multiplayer zero-sum game

Use your own simulation to train a reinforcement learning multi-agent

How to correctly train policies in multi-agent RL?

What is the relation between non-stationarity and "a moving target problem" in multi-agent reinforcement learning?

Has anyone here tried to implement MADDPG for a different environment and succeeded?

How can rewards and loss calculation be extended to multiple agents in a vanilla policy gradient RL setting?

Book/course recommendation on game theory application to multi-agent system (reinforcement learning)

Can a Reinforcement Learning problem with multiple simultaneous actions be formalized as a Multiagent Partially Observable Markov Decision Process?

How to parallelize multi-agent DDPG (MADDPG)

How to implement a rule-based decision maker for an agent-based model?

RLLib - What exactly do the avail_action and action_embed_size represent? How do they work with the action_mask to phase out invalid actions?

RLlib's Multi-agent PPO continuous actions turn into nan

Hot Network Questions