Highest Voted 'deep-rl' Questions - Artificial Intelligence Stack Exchange

24

votes

2 answers

Are there other approaches to deal with variable action spaces?

This question is about Reinforcement Learning and variable action spaces for every/some states. Variable action space Let's say you have an MDP, where the number of actions varies between states (for example like in Figure 1 or Figure 2). We can…

asked Dec 12 '18 at 13:27

Rikard Olsson

341
1
3
8

22

votes

3 answers

Why doesn't Q-learning converge when using function approximation?

The tabular Q-learning algorithm is guaranteed to find the optimal $Q$ function, $Q^*$, provided the following conditions (the Robbins-Monro conditions) regarding the learning rate are satisfied $\sum_{t} \alpha_t(s, a) = \infty$ $\sum_{t}…

reinforcement-learning q-learning deep-rl proofs function-approximation

asked Apr 05 '19 at 18:23

nbro

39,006
12
98
176

18

votes

1 answer

How does LSTM in deep reinforcement learning differ from experience replay?

In the paper Deep Recurrent Q-Learning for Partially Observable MDPs, the author processed the Atari game frames with an LSTM layer at the end. My questions are: How does this method differ from the experience replay, as they both use past…

reinforcement-learning long-short-term-memory deep-rl comparison experience-replay

asked Aug 27 '18 at 01:58

Kevin. Fang

353
1
2
7

17

votes

1 answer

Why does DQN require two different networks?

I was going through this implementation of DQN and I see that on line 124 and 125 two different Q networks have been initialized. From my understanding, I think one network predicts the appropriate action and the second network predicts the target Q…

reinforcement-learning deep-rl q-learning dqn target-network

asked Jul 02 '18 at 07:47

amitection

307
2
6

16

votes

2 answers

What is the difference between Q-learning, Deep Q-learning and Deep Q-network?

Q-learning uses a table to store all state-action pairs. Q-learning is a model-free RL algorithm, so how could there be the one called Deep Q-learning, as deep means using DNN; or maybe the state-action table (Q-table) is still there but the DNN is…

reinforcement-learning comparison q-learning dqn deep-rl

asked Jan 22 '21 at 09:41

Dee

1,283
1
11
35

14

votes

2 answers

How large should the replay buffer be?

I'm learning DDPG algorithm by following the following link: Open AI Spinning Up document on DDPG, where it is written In order for the algorithm to have stable behavior, the replay buffer should be large enough to contain a wide range of…

reinforcement-learning deep-rl hyper-parameters ddpg experience-replay

asked Apr 04 '19 at 14:40

ycenycute

341
1
2
6

11

votes

1 answer

What exactly is the advantage of double DQN over DQN?

I started looking into the double DQN (DDQN). Apparently, the difference between DDQN and DQN is that in DDQN we use the main value network for action selection and the target network for outputting the Q values. However, I don't understand why…

comparison q-learning dqn deep-rl double-dqn

asked Jul 30 '20 at 19:40

Chukwudi

349
2
7

10

votes

3 answers

How can you represent the state and action spaces for a card game in the case of a variable number of cards and actions?

I know how a machine can learn to play Atari games (Breakout): Playing Atari with Reinforcement Learning. With the same technique, it is even possible to play FPS games (Doom): Playing FPS Games with Reinforcement Learning. Further studies even…

reinforcement-learning game-ai deep-rl reference-request markov-decision-process

asked Oct 26 '16 at 08:11

Stefe Klauou

201
2
7

10

votes

2 answers

Was DeepMind's DQN learning simultaneously all the Atari games?

DeepMind states that its deep Q-network (DQN) was able to continually adapt its behavior while learning to play 49 Atari games. After learning all games with the same neural net, was the agent able to play them all at 'superhuman' levels…

reinforcement-learning deep-rl dqn deepmind atari-games

asked Oct 20 '16 at 01:42

Dion

203
2
6

9

votes

2 answers

What are the biggest barriers to get RL in production?

I am studying the state of the art of Reinforcement Learning, and my point is that we see so many applications in the real world using Supervised and Unsupervised learning algorithms in production, but I don't see the same thing with Reinforcement…

reinforcement-learning deep-rl applications

asked Jan 28 '21 at 02:11

Alexandre Krul

103
5

8

votes

1 answer

Is Experience Replay like dreaming?

Drawing parallels between Machine Learning techniques and a human brain is a dangerous operation. When it is done successfully, it can be a powerful tool for vulgarisation, but when it is done with no precaution, it can lead to major…

reinforcement-learning dqn deep-rl experience-replay

asked Sep 09 '18 at 19:07

16Aghnar

591
2
10

8

votes

2 answers

What is experience replay in laymen's terms?

I've been reading Google's DeepMind Atari paper and I'm trying to understand the concept of "experience replay". Experience replay comes up in a lot of other reinforcement learning papers (particularly, the AlphaGo paper), so I want to understand…

deep-learning reinforcement-learning deep-rl experience-replay

asked May 30 '18 at 19:09

user491626

241
1
4

8

votes

2 answers

Where to publish a first article in Deep Reinforcement Learning?

What would be examples of journals that are good for a first publication in the field of Deep Reinforcement Learning? I am in the process of writing about the research results of DQN-related algorithms. I have 3 requirements - it should be indexed…

reinforcement-learning deep-rl papers research

asked Nov 07 '17 at 09:02

Evalds Urtans

377
3
9

8

votes

2 answers

What are some online courses for deep reinforcement learning?

What are some (good) online courses for deep reinforcement learning? I would like the course to be both programming and theoretical. I really liked David Silver's course, but the course dates from 2015. It doesn't really teach deep Q-learning at…

reinforcement-learning q-learning dqn deep-rl resource-request

asked Mar 25 '20 at 14:46

J.Doe

91
3

8

votes

2 answers

Can DQN perform better than Double DQN?

I'm training both DQN and double DQN in the same environment, but DQN performs significantly better than double DQN. As I've seen in the double DQN paper, double DQN should perform better than DQN. Am I doing something wrong or is it possible?

reinforcement-learning q-learning dqn deep-rl double-dqn

asked Apr 08 '19 at 09:08

Angelo

201
2
16

Questions tagged [deep-rl]