Highest Voted 'observation-spaces' Questions - Artificial Intelligence Stack Exchange

4

votes

1 answer

Are multi agent or self-play environments always automatically POMDPs?

As part of my thesis, I'm working on a zero sum game with RL to train an agent. The game is a real-time game, a derivation of pong, one could imagine playing pong with both sides being foosball rods. As I see it, this is an MDP with perfect…

asked Jan 16 '22 at 16:45

kitaird

115
5

3

votes

2 answers

Should I apply normalization to the observations in deep reinforcement learning?

I am new to DRL and trying to implement my custom environment. I want to know if normalization and regularization techniques are as important in RL as in Deep Learning. In my custom environment, the state/observation values are in a different range.…

reinforcement-learning deep-rl regularization normalisation observation-spaces

asked Sep 07 '21 at 06:06

moyukh

31
1
2

3

votes

2 answers

What happens when the agent faces a state that never before encountered?

I have a network with nodes and links, each of them with a certain amount of resources (that can take discrete values) at the initial state. At random time steps, a service is generated, and, based on the agent's action, the network status changes,…

reinforcement-learning q-learning state-spaces observation-spaces

asked Dec 02 '20 at 13:12

krm76

155
7

2

votes

0 answers

What to look out for when designing an environment regarding observations?

When designing an environment, what should one look out for when designing the observation space to make the environment as easy to be learnable for an agent as possible? E.g. make sure the markov property is fulfilled if possible, but I mean also…

reinforcement-learning ai-design environment observation-spaces

asked Jan 23 '22 at 16:41

kitaird

115
5

2

votes

0 answers

How do neural networks deal with inputs of different sizes that are padded in order to have them of the same size?

I am trying to create an environment for RL where the size of my input (observation space) is not fixed. As a way around it, I thought about padding the size to a maximum value and then assigning "null" to those values that do not exist. Now, these…

neural-networks reinforcement-learning observation-spaces padding

asked Aug 06 '21 at 16:33

user101464

61
3

1

vote

1 answer

Variable observation space at each episode

I have an enviroment with continuous actions and state variables. Every time I reset my env, between 2 and 5 balls spawn randomly in a box of 100x100 size. One of those balls (the red one) will receive an action (direction of movement) and will move…

reinforcement-learning state-spaces continuous-state-spaces observation-spaces

asked Jun 16 '23 at 11:40

Optical_flow_lover

41
2

1

vote

1 answer

Scrabble rack observation with MuZero

Currently I'm trying to implement Scrabble with MuZero. The $15 \times 15$ game board observation (as input) is of size $27 \times15 \times15$ (26 letters + 1 wildcard) with a value of 0 or 1. However I'm having difficulties finding a suitable way…

reinforcement-learning game-ai muzero observation-spaces board-games

asked Jul 20 '21 at 10:29

Thrusticy

11
1

1

vote

0 answers

Does the order in which the features are concatenated to create the state (or observation) matter?

I'm experimenting with an RL agent that interacts with the following environment. The learning algorithm is double DQN. The neural network represents the function from state to action. It's build with Keras sequential model and has two dense layers.…

reinforcement-learning deep-rl feature-selection state-spaces observation-spaces

asked Nov 18 '20 at 17:36

mark mark

753
4
23

0

votes

1 answer

What are the differences between loss surfaces that "derive"from different observations?

If I understand right that each observation whithin a dataset, creates a different loss surface where we want to find the global minimum. How different those surfaces one from another? Would it be correct to say that they differ like (for example)…

datasets gradient-descent loss observation-spaces

asked Aug 10 '23 at 04:55

Igor

181
10

0

votes

1 answer

When should discretization of observations be considered?

I found some literature regarding the design of action-spaces and that e.g. a discretization of continuous actions in video-game environments can be crucial for successful learning (Action Space Shaping in Deep Reinforcement Learning,…

reinforcement-learning observation-spaces discretization

asked Jan 19 '22 at 21:35

kitaird

115
5

Questions tagged [observation-spaces]