Questions tagged [d3qn]
3 questions
1
vote
2 answers
Questions on the identifiability issue and equations 8 and 9 in the D3QN paper
I have difficulty understanding the following paragraph in the below excerpts from page 4 to page 5 from the paper Dueling Network Architectures for Deep Reinforcement Learning.
The author said "we can force the advantage function estimator to have…

Cheng
- 85
- 4
1
vote
0 answers
How does having zero advantage help with identifiability?
I am reading the D3QN paper and they have the following paragraph -
Equation (7) is unidentifiable in the sense that given $Q$ we cannot recover $V$ and $A$ uniquely. To see this, add a constant to $V(s; \theta, \beta)$ and subtract the same…

desert_ranger
- 586
- 3
- 19
1
vote
1 answer
Why do we need to have two heads in D3QN to obtain value and advantage separately, if V is the average of Q values?
I have two questions on the Dueling DQN paper. First, I have an issue on understanding the identifiability that Dueling DQN paper mentions:
Here is my question: If we have given Q-values $Q(s, a; \theta)$ for all actions, I assume we can get value…

Afshin Oroojlooy
- 175
- 1
- 7