1

I couldn't understand the wording here.

Training for the comparisons

What does "shuffle the comparisons into one dataset" mean?

How does the method they use don't have $K \choose 2$ forward passes for K completions? Do they update $K \choose 2$ in an epoch for K completions or what?

nbro
  • 39,006
  • 12
  • 98
  • 176

0 Answers0