Questions tagged [reparameterization-trick]

For questions about the re-parameterization trick, which is used e.g. in VAEs.

2 questions

vote

1 answer

Is the re-parameterization trick necessary in the policy gradient method?

If we want to learn a stochastic policy with the policy gradient method, we have to sample from the distribution to get an action. Wouldn't this lead to the same issue that variational autoencoders face without the reparameterization trick, where…

asked Apr 04 '23 at 12:45

Sam

votes

1 answer

Why clamp std for reparameterization trick between -20 and 2?

In the Soft Actor Critic Paper (found here https://arxiv.org/pdf/1801.01290.pdf), they use a neural network to approximate a diagonal gaussian distribution. In the sample function you can see that it has a function called reparameterize. As you can…

reinforcement-learning soft-actor-critic reparameterization-trick

asked May 03 '23 at 13:24

chadmc