Questions tagged [multi-agent-rl]

For questions about multi-agent reinforcement learning (MARL) agents, algorithms, or models.

11 questions
4
votes
2 answers

How do I get started with multi-agent reinforcement learning?

Is there any tutorial that walks through a multi-agent reinforcement learning implementation (in Python) using libraries such as OpenAI's Gym (for the environment), TF-agents, and stable-baselines-3? I searched a lot, but I was not able to find any…
3
votes
2 answers

How to model a multi-agent reinforcement learning problem where actions of different agents can take different durations?

I am confused on a conceptual scale how I would be able to model a multi-agent reinforcement learning problem when each agent performing an action would take different durations to complete the action. This means that a certain action is performed…
2
votes
0 answers

When to model decision-making problem as single agent vs multi-agent problem?

I understand the goals and purposes of RL in the case of a single agent and the underlying model, i.e. MDPs, for RL problems (or sequential decision making with uncertainty in general). My question is (and I know this will/may be subjective) are the…
2
votes
0 answers

Which multi-agent reinforcement learning algorithm can I use when there are two types of agents with different action spaces?

Most of the papers on multi-agent RL (MARL) that I have encountered have multiple agents who have a common action space. In my work, my scenario involves $m$ numbers of a particular agent (say type A) and $n$ numbers of another type of agent. Here,…
2
votes
1 answer

Why can I still easily beat my Q-learning agent that was trained against another Q-learning agent to play tic tac toe?

I implemented the Q-learning algorithm to play tic-tac-toe. The AI plays against the same algorithm, but they don't share the same Q matrix. After 200,000 games, I still beat the AI very easily and it's rather dumb. My selection is made by epsilon…
2
votes
1 answer

Is there a multi-agent deep reinforcement learning algorithm which is for environments with only discrete action spaces (Not hybrid)?

Is there a multi-agent deep reinforcement learning algorithm which is for environments with only discrete action spaces (Not hybrid) and have centralized training? I have been looking for algorithms, (A2C, MADDPG etc.) but still havent find any…
1
vote
1 answer

How can rewards and loss calculation be extended to multiple agents in a vanilla policy gradient RL setting?

Say I have a simple multi-agent reinforcement learning problem using vanilla policy gradient methods (i.e. REINFORCE) that is currently running with one network per agent. If I can say that each of my agents: are all of the same class have…
1
vote
0 answers

When should I use an MARL approach instead of training one agent while keep the others fixed?

I have built a custom multi-agent environment with PettingZoo, where a turn-based game with two agents, A and B, is setup. I want to examine situations where malicious behavior may arise, given the game rules, and I am looking into training…
npit
  • 111
  • 1
0
votes
0 answers

Help defining environment with complex action space

I'm working on a personal MARL project with a high-dimensional and continuous action space. The environment is designed to give positive rewards to actions between some moving limits of the action range, and negative rewards to the actions outside…
0
votes
0 answers

Are there assumptions made about Self-Play that don't hold up in regular MA competition?

I read about this paper Efficient Competitive Self-Play Policy Optimization which proposes an algorithm for training a population of agents with self-play using a perturbation based matchmaking approach. I was wondering if this algorithm can also be…
-1
votes
1 answer

RL framework to optimize my custom multi-agent simulator

I have built a custom discrete event simulator with multiple agents and want to optimize the system using RL frameworks that support multi-agent configurations. I will use custom policies. Which framework should I use? I've looked into SB3, CleanRL,…