Skip to content

Generative Modeling Overview: a Probabilistic Perspective

Goals and Traits of Generative Models

  • Density Estimation: pointwise evaluation of pθ(x).
  • Data Generation: generating new samples xpθ(x).
    • Conditional Generation: xpθ(x|c) for some condition c.
    • Imputation: xmpθ(xm|xo) for some partially observed data xo.
  • Training Target: the gradient of the loss θL(θ) should be tractable.
    • Corresponding Metrics: (Forward) KL-divergence DKL(pdata||pθ), etc.
  • Latents: whether latent variables z are introduced, potentially enabling latent interpolation and arithmetics.
  • Architecture: whether there are architectural restrictions on the neural network when modeling pθ(x).

Note

  • Supporting fast pointwise evaluation of pθ(x) does not necessarily allow fast sampling xpθ(x), and vice versa.
  • Tractable training target L(θ) does not necessarily allow fast pointwise evaluation of pθ(x) or fast sampling xpθ(x).
  • Likelihood (MLE) is often uncorrelated with the perceptual quality of the samples (images, sound, etc.) 1

Discussion

How can we parameterize pθ(x) for high-dimensional input, while allowing us to sample from it?

Considerations:

  • Can we perform pointwise evaluation of pθ(x)?
  • (Required) Can we sample from pθ(x)?
  • (Required) Can we efficiently compute the gradient of the loss?
  • Should we introduce a latent variable z?
  • How can we design the model architecture to satisfy the restrictions?

Taxonomy of Generative Models

Overview of different types of generative models, from Fig.1 of What are Diffusion Models? by Lilian Weng, and from Fig.20.1 of Probabilistic Machine Learning: Advanced Topics.

Characteristics of common kinds of generative model, modified from Table 20.1 of Probabilistic Machine Learning: Advanced Topics.
Model Density Sampling Training Latents Architecture
VAE LB, fast Fast MLE-LB RL Encoder-Decoder
ARM Exact, fast Slow MLE None Sequential
Flows Exact, slow/fast Slow MLE RD Invertible
EBM Approx, slow Slow MLE-Approx Optional Discriminative
DM LB Slow MLE-LB RD Encoder-Decoder
GAN N/A Fast Min-max RL Generator-Discriminator

Community Resources


  1. See Section 20.4.1.3 "Likelihood can be hard to compute" from Probabilistic Machine Learning: Advanced Topics 

Comments