What is the sample complexity of Monte Carlo Exploring Starts in RL?

Asked Jul 13 '21 at 04:33

Active Jul 13 '21 at 10:59

Viewed 69 times

We can use a model-free Monte Carlo approach to solving an MDP $(S,A,R,P,\gamma)$ with transition dynamics $P$ unknown by estimating Q-values by rolling out trajectories starting from random states $s_0 \in S$ and improving the policy $\pi$ greedily. This is the Monte Carlo Exploring Starts algorithm in Sutton and Barto page 99 2nd edition.

Does anyone know if there is a sample complexity result for this algorithm?

edited Jul 13 '21 at 10:59

nbro

39,006
12
98
176

asked Jul 13 '21 at 04:33

Snowball

What is the sample complexity of Monte Carlo Exploring Starts in RL?

0 Answers0