0

I often see Thompson Sampling in RL literature, however, I am not able to relate it to any of the current RL techniques. How exactly does it fit with RL?

desert_ranger
  • 586
  • 3
  • 19

1 Answers1

3

Thompson Sampling (TS) is used in the context of bandits, which is a special case of the RL problem.

You can also use TS for the full RL problem, but that can lead to inefficient exploration. To know more about this issue, you could read

nbro
  • 39,006
  • 12
  • 98
  • 176