For questions about multi-agent reinforcement learning (MARL) agents, algorithms, or models.
Questions tagged [multi-agent-rl]
11 questions
4
votes
2 answers
How do I get started with multi-agent reinforcement learning?
Is there any tutorial that walks through a multi-agent reinforcement learning implementation (in Python) using libraries such as OpenAI's Gym (for the environment), TF-agents, and stable-baselines-3?
I searched a lot, but I was not able to find any…

Rnj
- 221
- 2
- 6
3
votes
2 answers
How to model a multi-agent reinforcement learning problem where actions of different agents can take different durations?
I am confused on a conceptual scale how I would be able to model a multi-agent reinforcement learning problem when each agent performing an action would take different durations to complete the action. This means that a certain action is performed…

hridayns
- 223
- 2
- 12
2
votes
0 answers
When to model decision-making problem as single agent vs multi-agent problem?
I understand the goals and purposes of RL in the case of a single agent and the underlying model, i.e. MDPs, for RL problems (or sequential decision making with uncertainty in general).
My question is (and I know this will/may be subjective) are the…

David
- 121
- 3
2
votes
0 answers
Which multi-agent reinforcement learning algorithm can I use when there are two types of agents with different action spaces?
Most of the papers on multi-agent RL (MARL) that I have encountered have multiple agents who have a common action space.
In my work, my scenario involves $m$ numbers of a particular agent (say type A) and $n$ numbers of another type of agent. Here,…

user3656142
- 185
- 5
2
votes
1 answer
Why can I still easily beat my Q-learning agent that was trained against another Q-learning agent to play tic tac toe?
I implemented the Q-learning algorithm to play tic-tac-toe. The AI plays against the same algorithm, but they don't share the same Q matrix. After 200,000 games, I still beat the AI very easily and it's rather dumb. My selection is made by epsilon…

Irindul
- 39
- 7
2
votes
1 answer
Is there a multi-agent deep reinforcement learning algorithm which is for environments with only discrete action spaces (Not hybrid)?
Is there a multi-agent deep reinforcement learning algorithm which is for environments with only discrete action spaces (Not hybrid) and have centralized training?
I have been looking for algorithms, (A2C, MADDPG etc.) but still havent find any…

Uur Kn
- 21
- 1
1
vote
1 answer
How can rewards and loss calculation be extended to multiple agents in a vanilla policy gradient RL setting?
Say I have a simple multi-agent reinforcement learning problem using vanilla policy gradient methods (i.e. REINFORCE) that is currently running with one network per agent. If I can say that each of my agents:
are all of the same class
have…

Josh
- 89
- 9
1
vote
0 answers
When should I use an MARL approach instead of training one agent while keep the others fixed?
I have built a custom multi-agent environment with PettingZoo, where a turn-based game with two agents, A and B, is setup.
I want to examine situations where malicious behavior may arise, given the game rules, and I am looking into training…

npit
- 111
- 1
0
votes
0 answers
Help defining environment with complex action space
I'm working on a personal MARL project with a high-dimensional and continuous action space. The environment is designed to give positive rewards to actions between some moving limits of the action range, and negative rewards to the actions outside…
0
votes
0 answers
Are there assumptions made about Self-Play that don't hold up in regular MA competition?
I read about this paper Efficient Competitive Self-Play Policy Optimization which proposes an algorithm for training a population of agents with self-play using a perturbation based matchmaking approach.
I was wondering if this algorithm can also be…

kitaird
- 115
- 5
-1
votes
1 answer
RL framework to optimize my custom multi-agent simulator
I have built a custom discrete event simulator with multiple agents and want to optimize the system using RL frameworks that support multi-agent configurations. I will use custom policies. Which framework should I use? I've looked into SB3, CleanRL,…

RookieScientist
- 99
- 2