Generative Modeling Overview: a Probabilistic Perspective

Goals and Traits of Generative Models

Density Estimation: pointwise evaluation of $p_{θ} (x)$ .
Data Generation: generating new samples $x \sim p_{θ} (x)$ .
- Conditional Generation: $x \sim p_{θ} (x | c)$ for some condition $c$ .
- Imputation: $x_{m} \sim p_{θ} (x_{m} | x_{o})$ for some partially observed data $x_{o}$ .
Training Target: the gradient of the loss $\nabla_{θ} L (θ)$ should be tractable.
- Corresponding Metrics: (Forward) KL-divergence $D_{KL} (p_{d a t a} | | p_{θ})$ , etc.
Latents: whether latent variables $z$ are introduced, potentially enabling latent interpolation and arithmetics.
Architecture: whether there are architectural restrictions on the neural network when modeling $p_{θ} (x)$ .

Note

Supporting fast pointwise evaluation of $p_{θ} (x)$ does not necessarily allow fast sampling $x \sim p_{θ} (x)$ , and vice versa.
Tractable training target $L (θ)$ does not necessarily allow fast pointwise evaluation of $p_{θ} (x)$ or fast sampling $x \sim p_{θ} (x)$ .
Likelihood (MLE) is often uncorrelated with the perceptual quality of the samples (images, sound, etc.) ¹

How can we parameterize $p_{θ} (x)$ for high-dimensional input, while allowing us to sample from it?

Considerations:

Characteristics of common kinds of generative model, modified from Table 20.1 of Probabilistic Machine Learning: Advanced Topics.

Model	Density	Sampling	Training	Latents	Architecture
VAE	LB, fast	Fast	MLE-LB	$R^{L}$	Encoder-Decoder
ARM	Exact, fast	Slow	MLE	None	Sequential
Flows	Exact, slow/fast	Slow	MLE	$R^{D}$	Invertible
EBM	Approx, slow	Slow	MLE-Approx	Optional	Discriminative
DM	LB	Slow	MLE-LB	$R^{D}$	Encoder-Decoder
GAN	N/A	Fast	Min-max	$R^{L}$	Generator-Discriminator

See Section 20.4.1.3 "Likelihood can be hard to compute" from Probabilistic Machine Learning: Advanced Topics ↩