0

Suppose we've got the following formula:

$\log p(x;\theta)=\mathbb{E}_{q(z|x;\phi)}[\log p(x,z;\theta)-\log q(z|x;\phi)]+KL(q(z|x;\phi)||p(z|x;\theta))\\ \geq \mathbb{E}_{q(z|x;\phi)}[\log p(x,z;\theta)-\log q(z|x;\phi)]$

We call this Evidence Lower Bound (ELBO):

$\mathrm{ELBO}=\mathbb{E}_{q(z|x;\phi)}[\log p(x,z;\theta)-\log q(z|x;\phi)]$

We use the method of maximizing ELBO to optimize the parameters $\theta$ and $\phi$:

$\max_{\theta,\phi}\mathbb{E}_{\mathcal{D}}\mathbb{E_{q(z|x;\phi)}}[\log p(x,z;\theta)-\log q(z|x;\phi)]$

We can intuitively understand that: for given $\theta$, the upper bound of ELBO is $\log p(x;\theta)$, so optimizing $\phi$ allows ELBO to be as close as $\log p(x;\theta)$ possible; Then, optimizing $\theta$ can make the data likelihood as large as possible, which achieves the goal of MLE. (just like what shown in Fig) enter image description here

The ELBO mentioned above is also the loss function of VAE. (An algebraic transformation of the ELBO mentioned above: $\mathrm{ELBO}=\mathbb{E}_{q(z|x;\phi)}[\log p(x|z;\theta)]-KL(q(z|x;\phi)||p(z;\theta))\\\\$)

My question is:

1.When maximizing ELBO, how can I fix $\theta$ and then $\phi$, as mentioned above?

2.VAE optimizes them at the same time, there is no precedence. How could I understand the difference?

Garfield
  • 11
  • 1
  • I don't understand your question: "How could I understand the difference?". What difference are you referring to? Can you please also put your actual question in the title? Your question is not how to optimize the ELBO, apparently, if I understand correctly your first question in the body. – nbro Dec 30 '22 at 20:24
  • Thank you for your attention. When applying EM algorithm (Expectation Maximum) to optimize ELBO, we usually fix $\theta$ first and then $\phi$. So I wonder if I can apply to VI(Variational Inference) as well? The difference i refer to is that one(EM/VI) fixes $\theta$ so that ELBO approaches logp(x), and then optimizes $\theta$ to increase ELBO so that logp(x) increases; The other(VAE) is to maximize ELBO without regard to fixation and sequencing. – Garfield Dec 31 '22 at 02:01
  • I have described the problem more clearly(https://ai.stackexchange.com/questions/38537/is-vae-the-same-as-the-e-step-of-the-em-algorithm) and hope to get your help! – Garfield Dec 31 '22 at 06:42

0 Answers0