The Spacetime of Diffusion Models: An Information Geometry Perspective

Rafal Karczewski, Markus Heinonen, Alison Pouplin, Søren Hauberg, Vikas K Garg

Diffusion & Flow Matching Sat, Apr 25 · 3:27 PM–3:37 PM · 201 A/B Avg rating: 6.50 (4–8)

Abstract

We present a novel geometric perspective on the latent space of diffusion models. We first show that the standard pullback approach, utilizing the deterministic probability flow ODE decoder, is fundamentally flawed. It provably forces geodesics to decode as straight segments in data space, effectively ignoring any intrinsic data geometry beyond the ambient Euclidean space. Complementing this view, diffusion also admits a stochastic decoder via the reverse SDE, which enables an information geometric treatment with the Fisher-Rao metric. However, a choice of $\mathbf{x}_T$ as the latent representation collapses this metric due to memorylessness. We address this by introducing a latent spacetime $\mathbf{z}=(\mathbf{x}_t,t)$ that indexes the family of denoising distributions $p(\mathbf{x}_0 | \mathbf{x}_t)$ across all noise scales, yielding a nontrivial geometric structure. We prove these distributions form an exponential family and derive simulation-free estimators for curve lengths, enabling efficient geodesic computation. The resulting structure induces a principled Diffusion Edit Distance, where geodesics trace minimal sequences of noise and denoise edits between data. We also demonstrate benefits for transition path sampling in molecular systems, including constrained variants such as low-variance transitions and region avoidance. Code is available at: https://github.com/rafalkarczewski/spacetime-geometry.

One-sentence summary·Auto-generated by claude-haiku-4-5-20251001(?)

Spacetime perspective views diffusion latent spaces as Fisher-Rao metric manifolds enabling efficient geodesic computation without simulation.

Contributions·Auto-generated by claude-haiku-4-5-20251001(?)

Introduces latent spacetime representation (x_t, t) indexing family of denoising distributions across noise scales
Proves denoising distributions form exponential family enabling tractable geodesic estimation
Derives simulation-free estimators for curve lengths enabling efficient geodesic computation in high dimensions

Methods used·Auto-generated by claude-haiku-4-5-20251001(?)

Information geometry
Fisher-Rao metric
Optimal transport
Exponential families

Datasets used·Auto-generated by claude-haiku-4-5-20251001(?)

Molecular systems
Image datasets

Limitations (author-stated)·Auto-generated by claude-haiku-4-5-20251001(?)

Optimizing between nearly clean samples numerically unstable due to denoising distributions collapsing to Dirac deltas with effectively infinite distances
from the paper
Proposed Diffusion Edit Distance considerably slower than established similarity metrics like LPIPS and SSIM
from the paper

Future work (author-stated)·Auto-generated by claude-haiku-4-5-20251001(?)

Explore distillation strategy training separate model to predict Diffusion Edit Distance
from the paper

Author keywords

diffusion models
information geometry

Something off? Let us know →

The Spacetime of Diffusion Models: An Information Geometry Perspective

Abstract

Author keywords

Related orals

Universal Inverse Distillation for Matching Models with Real-Data Supervision (No GANs)

GLASS Flows: Efficient Inference for Reward Alignment of Flow and Diffusion Models

Neon: Negative Extrapolation From Self-Training Improves Image Generation

Generative Human Geometry Distribution

Cross-Domain Lossy Compression via Rate- and Classification-Constrained Optimal Transport