1

In this Stanford lecture (minute 35:47 and 37:00), the professor says that Monte Carlo (MC) linear function approximation does not always converge, and she gives an example. In general, when does MC linear function approximation converge (or not)?

Why do people use that MC linear function approximation if sometimes it doesn't converge?

They also gave the definition of the stationary distribution of a policy, and I am not sure if using it for function approximation converges or not.

nbro
  • 39,006
  • 12
  • 98
  • 176
dato nefaridze
  • 862
  • 6
  • 20
  • I don't think your question is a duplicate (because you're asking about Monte Carlo approximations), but here's a related question [Why doesn't Q-learning converge when using function approximation?](https://ai.stackexchange.com/q/11679/2444). – nbro Apr 30 '20 at 14:27

0 Answers0