How exactly does self-play work, and how does it relate to MCTS?

Asked Feb 09 '20 at 13:05

Active Feb 09 '20 at 14:28

Viewed 155 times

I am working towards using RL to create an AI for a two-player, hidden-information, a turn-based board game. I have just finished David Silver's RL course and Denny Britz's coding exercises, and so am relatively familiar with MC control, SARSA, Q-learning, etc. However, the course was focused on single-player, perfect-information games, and I haven't managed to find any examples similar to the type of game I have, and would like advice on how to proceed.

I am still unsure how self-play works, and how it relates to MCTS. For example, I don't know if this involves using the latest agent to play both sides, or playing an agent against older versions, or training multiple opposing agents simultaneously. Are there good examples (or repositories) for learning self-play and MCTS for two-player games?

edited Feb 09 '20 at 14:28

nbro

39,006
12
98
176

asked Feb 09 '20 at 13:05

Alienator

How exactly does self-play work, and how does it relate to MCTS?

0 Answers0