Highest Voted 'double-q-learning' Questions - Artificial Intelligence Stack Exchange

7

votes

1 answer

Deep Q-Learning "catastrophic drop" reasons?

I am implementing some "classical" papers in Model Free RL like DQN, Double DQN, and Double DQN with Prioritized Replay. Through the various models im running on CartPole-v1 using the same underlying NN, I am noticing all of the above 3 exhibit a…

asked Jun 03 '21 at 14:33

Virus

71
1
5

5

votes

1 answer

Why does regular Q-learning (and DQN) overestimate the Q values?

The motivation for the introduction of double DQN (and double Q-learning) is that the regular Q-learning (or DQN) can overestimate the Q value, but is there a brief explanation as to why it is overestimated?

reinforcement-learning q-learning dqn double-dqn double-q-learning

asked Jan 10 '21 at 12:53

ground clown

111
2

2

votes

1 answer

How to embed game grid state with walls as an input to neural network

I've read most of the posts on here regarding this subject, however most of them deal with gameboards where there are two different categories of single pieces on a board without walls etc. My game board has walls, and multiple instances of food.…

convolutional-neural-networks deep-rl game-ai double-q-learning one-hot-encoding

asked Sep 04 '21 at 00:44

Arlo Rostirolla

31
1

1

vote

1 answer

Q learning achieves small reward in simple dice game

I am trying to train a Q learning agent on the following game: The states are parametrised by an integer $S \geq 0$ (representing the sum of the previous die rolls). In each step the player can choose to roll a die or quit the game. Whenever the…

q-learning double-q-learning

asked Jun 19 '23 at 15:08

deepfloe

111
2

0

votes

1 answer

Does "number of actions" refer to the number of actions taken or size of the action space?

In the original DDQN article (https://arxiv.org/pdf/1509.06461.pdf,) the phrase "number of actions" is used twice; First, in the following context: Secondly in Theorem 1. I have a hard time understanding the way the phrase is being used or if it…

deep-learning papers dqn double-dqn double-q-learning

asked Jan 06 '22 at 21:44

GeorgeWTrump

37
5

0

votes

0 answers

Is there any toy example that can exemplify the performance of double Q-learning?

I recently tried to reproduce the results of double Q-learning. However, the results are not satisfying. I have also tried to compare double Q learning with Q-learning in Taxi-v3, FrozenLake without slippery, Roulette-v0, etc. But Q-learning…

reinforcement-learning q-learning double-q-learning

asked Dec 23 '20 at 07:56

Allen_FrCh

1
1

Questions tagged [double-q-learning]

Deep Q-Learning "catastrophic drop" reasons?

Why does regular Q-learning (and DQN) overestimate the Q values?

How to embed game grid state with walls as an input to neural network

Q learning achieves small reward in simple dice game

Does "number of actions" refer to the number of actions taken or size of the action space?

Is there any toy example that can exemplify the performance of double Q-learning?