Variational Autoencoder

Model

KL weight β 1.0

Latent noise σ 0.5

Classes 5

Interpolation t 0.5

Metrics

ELBO—

Recon loss—

KL divergence—

Overlap score—

A VAE (Kingma & Welling 2013) learns a structured latent space by maximizing the ELBO = E[log p(x|z)] - β·KL(q(z|x)||p(z)). The KL term regularizes the posterior toward N(0,I). Higher β (β-VAE) encourages disentanglement — latent dimensions correspond to independent factors. Interpolation in latent space produces semantically smooth transitions.