What is Wasserstein loss and why is it important in generative models?
Updated May 15, 2026
Short answer
Wasserstein loss measures optimal transport distance between probability distributions.
Deep explanation
Wasserstein distance defines the minimum cost required to transform one probability distribution into another. In generative models like Wasserstein GANs, this replaces divergence-based losses such as KL or Jensen-Shannon divergence. It provides smoother gradients even when distributions do not overlap, solving a key instability problem in GAN training.
Unlock with a Pro subscription to view this section.
View pricingReal-world example
No real-world example available yet.
Unlock with a Pro subscription to view this section.
Upgrade to ProCommon mistakes
No common mistakes listed yet.
Unlock with a Pro subscription to view this section.
Upgrade to ProFollow-up questions
No follow-up questions available yet.
Unlock with a Pro subscription to view this section.
Upgrade to Pro