0

Does there exist a variant of TS, such that, while computing the returns of multi-armed bandits, we have the possibility of introducing an extra bandit?

For instance, while we are applying TS to 3 slot machines, we come to know about the existence of a fourth slot machine. Therefore, we'd like to take that machine into account within our algorithm.

nbro
  • 39,006
  • 12
  • 98
  • 176
desert_ranger
  • 586
  • 3
  • 19

0 Answers0