For questions about discrete action spaces in the context of reinforcement learning (or other artificial intelligence sub-fields). There is also the tag for continuous action spaces.
Questions tagged [discrete-action-spaces]
10 questions
16
votes
3 answers
How to implement a variable action space in Proximal Policy Optimization?
I'm coding a Proximal Policy Optimization (PPO) agent with the Tensorforce library (which is built on top of TensorFlow).
The first environment was very simple. Now, I'm diving into a more complex environment, where all the actions are not available…

Max
- 163
- 1
- 6
6
votes
1 answer
What techniques are used to make MDP discrete state space manageable?
Generating a discretized state space for an MDP (Markov Decision Process) model seems to suffer from the curse of dimensionality.
Supposed my state has a few simple features:
Feeling: Happy/Neutral/Sad
Feeling: Hungry/Neither/Full
Food left:…

Brendan Hill
- 263
- 1
- 6
4
votes
1 answer
Can a large discrete action space be represented using Gaussian distributions?
I have a large 1D action space, e.g. dim(A)=2000-10000. Can I use continuous action space where I could learn the mean and std of the Gaussian distributions that I would use to sample action from and round the value to the nearest integer? If yes,…

Mika
- 331
- 1
- 8
2
votes
1 answer
Extend the loss function from the single action to the n-action case per time step
My question concerns a side question (which was not answered) asked here:
How can policy gradients be applied in the case of multiple continuous actions?
I am trying to implement a simple policy gradient algorithm for a discrete multi-action…

BruceWayne
- 31
- 2
2
votes
1 answer
Model-based RL algorithms for continuous state space and finite action space
At the beginning, if I have a complete model $p(s' \mid s, a)$ (an assumed true model that describes the environment well enough) and the reward function $r(s,a,s')$. How can I exploit the model and learn a good policy in this situation? Assume that…

k2pctdn
- 35
- 5
2
votes
1 answer
Is there a multi-agent deep reinforcement learning algorithm which is for environments with only discrete action spaces (Not hybrid)?
Is there a multi-agent deep reinforcement learning algorithm which is for environments with only discrete action spaces (Not hybrid) and have centralized training?
I have been looking for algorithms, (A2C, MADDPG etc.) but still havent find any…

Uur Kn
- 21
- 1
1
vote
1 answer
PPO: multiple discrete actions per step, one depends on the other
I have a custom PPO implementation, and it works fine, but I need to add to it the ability to select 2 actions per turn, one different in nature from the other, one dependent on the other.
Imagine that a turn a had 20 possible actions. Now, for each…

Antonis Karvelas
- 65
- 5
1
vote
0 answers
What adapts an algorithm to continuous or to discrete action spaces?
Some RL algorithms can only be used for environments with continuous action spaces (e.g TD3, SAC), while others only for discrete action spaces (DQN), and some for both
REINFORCE and other policy gradient variants have the choice of using a…

mugoh
- 531
- 4
- 20
0
votes
0 answers
Training Issue in Solving Multi-Dimensional Multiple Knapsack Problem with Transformer Model and PPO and SAC algorithm
I'm reaching out to the brilliant minds of the AI community to seek help with a challenging issue in my project on solving the multi-dimensional multiple knapsack problem using a transformer model. As part of my master's thesis focusing on resource…
0
votes
0 answers
How to implement RL model with increasing dimensions of state space and action space?
I've read in this discussion that "reinforcement learning is a way of finding the value function of a Markov Decision Process".
I want to implement an RL model, whose state space and action space dimensions would increase, as the MDP progresses. But…

brzepkowski
- 141
- 4