Highest Voted 'discrete-action-spaces' Questions - Artificial Intelligence Stack Exchange

16

votes

3 answers

How to implement a variable action space in Proximal Policy Optimization?

I'm coding a Proximal Policy Optimization (PPO) agent with the Tensorforce library (which is built on top of TensorFlow). The first environment was very simple. Now, I'm diving into a more complex environment, where all the actions are not available…

asked Aug 29 '18 at 16:04

Max

163
1
6

6

votes

1 answer

What techniques are used to make MDP discrete state space manageable?

Generating a discretized state space for an MDP (Markov Decision Process) model seems to suffer from the curse of dimensionality. Supposed my state has a few simple features: Feeling: Happy/Neutral/Sad Feeling: Hungry/Neither/Full Food left:…

reinforcement-learning markov-decision-process state-spaces continuous-action-spaces discrete-action-spaces

asked Dec 22 '16 at 01:35

Brendan Hill

263
1
6

4

votes

1 answer

Can a large discrete action space be represented using Gaussian distributions?

I have a large 1D action space, e.g. dim(A)=2000-10000. Can I use continuous action space where I could learn the mean and std of the Gaussian distributions that I would use to sample action from and round the value to the nearest integer? If yes,…

reinforcement-learning deep-rl continuous-action-spaces discrete-action-spaces

asked Mar 09 '21 at 05:34

Mika

331
1
8

2

votes

1 answer

Extend the loss function from the single action to the n-action case per time step

My question concerns a side question (which was not answered) asked here: How can policy gradients be applied in the case of multiple continuous actions? I am trying to implement a simple policy gradient algorithm for a discrete multi-action…

reinforcement-learning objective-functions discrete-action-spaces multi-action-rl

asked Apr 26 '18 at 13:42

BruceWayne

31
2

2

votes

1 answer

Model-based RL algorithms for continuous state space and finite action space

At the beginning, if I have a complete model $p(s' \mid s, a)$ (an assumed true model that describes the environment well enough) and the reward function $r(s,a,s')$. How can I exploit the model and learn a good policy in this situation? Assume that…

reinforcement-learning reference-request model-based-methods continuous-state-spaces discrete-action-spaces

asked Apr 11 '23 at 23:07

k2pctdn

35
5

2

votes

1 answer

Is there a multi-agent deep reinforcement learning algorithm which is for environments with only discrete action spaces (Not hybrid)?

Is there a multi-agent deep reinforcement learning algorithm which is for environments with only discrete action spaces (Not hybrid) and have centralized training? I have been looking for algorithms, (A2C, MADDPG etc.) but still havent find any…

reinforcement-learning deep-rl discrete-action-spaces multi-agent-rl

asked May 17 '21 at 09:23

Uur Kn

21
1

1

vote

1 answer

PPO: multiple discrete actions per step, one depends on the other

I have a custom PPO implementation, and it works fine, but I need to add to it the ability to select 2 actions per turn, one different in nature from the other, one dependent on the other. Imagine that a turn a had 20 possible actions. Now, for each…

reinforcement-learning proximal-policy-optimization discrete-action-spaces

asked Jun 09 '22 at 16:52

Antonis Karvelas

65
5

1

vote

0 answers

What adapts an algorithm to continuous or to discrete action spaces?

Some RL algorithms can only be used for environments with continuous action spaces (e.g TD3, SAC), while others only for discrete action spaces (DQN), and some for both REINFORCE and other policy gradient variants have the choice of using a…

reinforcement-learning dqn reinforce continuous-action-spaces discrete-action-spaces

asked Nov 15 '20 at 07:11

mugoh

531
4
20

0

votes

0 answers

Training Issue in Solving Multi-Dimensional Multiple Knapsack Problem with Transformer Model and PPO and SAC algorithm

I'm reaching out to the brilliant minds of the AI community to seek help with a challenging issue in my project on solving the multi-dimensional multiple knapsack problem using a transformer model. As part of my master's thesis focusing on resource…

transformer proximal-policy-optimization soft-actor-critic discrete-action-spaces

asked Aug 09 '23 at 11:56

Mohammad Hosseini

1

0

votes

0 answers

How to implement RL model with increasing dimensions of state space and action space?

I've read in this discussion that "reinforcement learning is a way of finding the value function of a Markov Decision Process". I want to implement an RL model, whose state space and action space dimensions would increase, as the MDP progresses. But…

reinforcement-learning action-spaces discrete-action-spaces

asked Dec 12 '20 at 15:11

brzepkowski

141
4

Questions tagged [discrete-action-spaces]