What is Wasserstein loss and why is it important in generative models?

Updated May 15, 2026

Short answer

Wasserstein loss measures optimal transport distance between probability distributions.

Deep explanation

Wasserstein distance defines the minimum cost required to transform one probability distribution into another. In generative models like Wasserstein GANs, this replaces divergence-based losses such as KL or Jensen-Shannon divergence. It provides smoother gradients even when distributions do not overlap, solving a key instability problem in GAN training.

Unlock with a Pro subscription to view this section.

View pricing