Questions tagged [greedy-policy]

For questions about greedy policies in the context of reinforcement learning and other AI sub-fields.

2 questions

votes

1 answer

Are Q-learning and SARSA the same when action selection is greedy?

I'm currently studying reinforcement learning and I'm having difficulties with question 6.12 in Sutton and Barto's book. Suppose action selection is greedy. Is Q-learning then exactly the same algorithm as SARSA? Will they make exactly the same…

asked May 10 '20 at 10:52

hyuj

votes

1 answer

Why are Q values updated according to the greedy policy?

Apparently, in the Q-learning algorithm, the Q values are not updated according to the "current policy", but according to a "greedy policy". Why is that the case? I think this is related to the fact that Q-learning is off-policy, but I am also not…

reinforcement-learning q-learning off-policy-methods greedy-policy

asked Nov 17 '18 at 16:23

Shifat E Arman