For questions about greedy policies in the context of reinforcement learning and other AI sub-fields.
Questions tagged [greedy-policy]
2 questions
11
votes
1 answer
Are Q-learning and SARSA the same when action selection is greedy?
I'm currently studying reinforcement learning and I'm having difficulties with question 6.12 in Sutton and Barto's book.
Suppose action selection is greedy. Is Q-learning then exactly the same algorithm as SARSA? Will they make exactly the same…

hyuj
- 131
- 4
3
votes
1 answer
Why are Q values updated according to the greedy policy?
Apparently, in the Q-learning algorithm, the Q values are not updated according to the "current policy", but according to a "greedy policy". Why is that the case? I think this is related to the fact that Q-learning is off-policy, but I am also not…

Shifat E Arman
- 83
- 1
- 5