Why does the ELBO come to a steady state and the latent space shrinks?

Question

I'm trying to train a VAE using a graph dataset. However, my latent space shrinks epoch by epoch. Meanwhile, my ELBO plot comes to a steady state after a few epochs.

I tried to play around with parameters and I realized, by increasing the batch size or training data, this happens faster, and ELBO comes to a steady state even faster.

Is this a common problem, with a general solution?

With these signs, which part of the algorithm is more possible to cause the issue? Is it an issue from computing loss function? Does it look like the decoder is not trained well? Or it is more likely for the encoder not to have detected features that are informative enough?

Edit:

I figured out that the problem is probably caused by the loss function. My loss function is a combination of the KL term and reconstruction loss. In the github page for graph auto-encoders, it is suggested that the loss function should include normalization factors according to the number of nodes in the graph. I haven't figured it out exactly, but by adding a factor of 100 to my reconstruction loss and a factor of 0.5 to my KL loss, the algorithm is working fine. I would appreciate it if someone can expand on how this exactly is supposed to be set up.

This is an old question, but can you explain what you mean by "my latent space shrinks"? Do you mean that the Gaussian from which you sample the latent vectors shrinks? Also, what do you mean by "the ELBO comes to a steady state"? Do you mean that the ELBO doesn't further decrease and remains more or less constant after some epoch? It would have been nice if you include the plots that show these behaviours. If you still have them, I would appreciate if you edit your plot to include them and clarify these points/questions. — nbro, Nov 22 '20 at 17:49

Why does the ELBO come to a steady state and the latent space shrinks?

0 Answers0