ICLR 2026 Orals

The Spacetime of Diffusion Models: An Information Geometry Perspective

Rafal Karczewski, Markus Heinonen, Alison Pouplin, Søren Hauberg, Vikas K Garg

Diffusion & Flow Matching Sat, Apr 25 · 3:27 PM–3:37 PM · 201 A/B Avg rating: 6.50 (4–8)

Abstract

We present a novel geometric perspective on the latent space of diffusion models. We first show that the standard pullback approach, utilizing the deterministic probability flow ODE decoder, is fundamentally flawed. It provably forces geodesics to decode as straight segments in data space, effectively ignoring any intrinsic data geometry beyond the ambient Euclidean space. Complementing this view, diffusion also admits a stochastic decoder via the reverse SDE, which enables an information geometric treatment with the Fisher-Rao metric. However, a choice of $\mathbf{x}_T$ as the latent representation collapses this metric due to memorylessness. We address this by introducing a latent spacetime $\mathbf{z}=(\mathbf{x}_t,t)$ that indexes the family of denoising distributions $p(\mathbf{x}_0 | \mathbf{x}_t)$ across all noise scales, yielding a nontrivial geometric structure. We prove these distributions form an exponential family and derive simulation-free estimators for curve lengths, enabling efficient geodesic computation. The resulting structure induces a principled Diffusion Edit Distance, where geodesics trace minimal sequences of noise and denoise edits between data. We also demonstrate benefits for transition path sampling in molecular systems, including constrained variants such as low-variance transitions and region avoidance. Code is available at: https://github.com/rafalkarczewski/spacetime-geometry.

One-sentence summary·Auto-generated by claude-haiku-4-5-20251001(?)

Spacetime perspective views diffusion latent spaces as Fisher-Rao metric manifolds enabling efficient geodesic computation without simulation.

Contributions·Auto-generated by claude-haiku-4-5-20251001(?)
  • Introduces latent spacetime representation (x_t, t) indexing family of denoising distributions across noise scales
  • Proves denoising distributions form exponential family enabling tractable geodesic estimation
  • Derives simulation-free estimators for curve lengths enabling efficient geodesic computation in high dimensions
Methods used·Auto-generated by claude-haiku-4-5-20251001(?)
  • Information geometry
  • Fisher-Rao metric
  • Optimal transport
  • Exponential families
Datasets used·Auto-generated by claude-haiku-4-5-20251001(?)
  • Molecular systems
  • Image datasets
Limitations (author-stated)·Auto-generated by claude-haiku-4-5-20251001(?)
  • Optimizing between nearly clean samples numerically unstable due to denoising distributions collapsing to Dirac deltas with effectively infinite distances
    from the paper
  • Proposed Diffusion Edit Distance considerably slower than established similarity metrics like LPIPS and SSIM
    from the paper
Future work (author-stated)·Auto-generated by claude-haiku-4-5-20251001(?)
  • Explore distillation strategy training separate model to predict Diffusion Edit Distance
    from the paper

Author keywords

  • diffusion models
  • information geometry

Related orals

Generative Human Geometry Distribution

Introduces distribution-over-distribution model combining geometry distributions with two-stage flow matching for human 3D generation.

Avg rating: 5.50 (2–8) · Xiangjun Tang et al.
Something off? Let us know →