Eligibility trace In Model-based Reinforcement Learning

Asked Jan 22 '19 at 22:16

Active Feb 16 '19 at 19:05

Viewed 97 times

In model-based reinforcement learning algorithms, the model of the environment is constructed to efficiently use samples, models such as Dyna, and Prioritize Sweeping. Moreover, eligibility trace helps the model learns (action) value functions faster.

Can I know if it is possible to combine learning, planning, and eligibility traces in a model to increase its convergence rate? If yes, how it is possible to use eligibility traces in the planning part, like Prioritize Sweeping?

edited Feb 16 '19 at 19:05

nbro

39,006
12
98
176

asked Jan 22 '19 at 22:16

Amin

Eligibility trace In Model-based Reinforcement Learning

0 Answers0