ICLR 2026 Orals

Discount Model Search for Quality Diversity Optimization in High-Dimensional Measure Spaces

Bryon Tjanaka, Henry Chen, Matthew Christopher Fontaine, Stefanos Nikolaidis

Reinforcement Learning & Agents Sat, Apr 25 · 4:15 PM–4:25 PM · 202 A/B Avg rating: 5.50 (4–8)
Author-provided TL;DR

We present a method that enhances exploration in quality diversity (QD) optimization and show how this method enables new applications for QD. Project page: https://discount-models.github.io/

Abstract

Quality diversity (QD) optimization searches for a collection of solutions that optimize an objective while attaining diverse outputs of a user-specified, vector-valued measure function. Contemporary QD algorithms are typically limited to low-dimensional measures because high-dimensional measures are prone to distortion, where many solutions found by the QD algorithm map to similar measures. For example, the state-of-the-art CMA-MAE algorithm guides measure space exploration with a histogram in measure space that records so-called discount values. However, CMA-MAE stagnates in domains with high-dimensional measure spaces because solutions with similar measures fall into the same histogram cell and hence receive the same discount value. To address these limitations, we propose Discount Model Search (DMS), which guides exploration with a model that provides a smooth, continuous representation of discount values. In high-dimensional measure spaces, this model enables DMS to distinguish between solutions with similar measures and thus continue exploration. We show that DMS facilitates new capabilities for QD algorithms by introducing two new domains where the measure space is the high-dimensional space of images, which enables users to specify their desired measures by providing a dataset of images rather than hand-designing the measure function. Results in these domains and on high-dimensional benchmarks show that DMS outperforms CMA-MAE and other existing black-box QD algorithms.

One-sentence summary·Auto-generated by claude-haiku-4-5-20251001(?)

Proposes Discount Model Search for quality diversity optimization in high-dimensional measure spaces.

Contributions·Auto-generated by claude-haiku-4-5-20251001(?)
  • Uses smooth continuous model for discount values instead of histogram cells to handle high-dimensional measures
  • Enables quality diversity algorithms to distinguish solutions with similar measures in high-dimensional spaces
  • Introduces image-based measure specification via dataset rather than hand-designed functions
Methods used·Auto-generated by claude-haiku-4-5-20251001(?)
  • Quality diversity optimization
  • Discount value estimation
  • Black-box optimization
  • Neural models
Limitations (author-stated)·Auto-generated by claude-haiku-4-5-20251001(?)
  • Training discount model induces computational overhead
    from the paper
  • Alternative targets for discount model training may improve properties like smoothness
    from the paper
  • Primarily considers small MLPs for discount model architecture
    from the paper
Future work (author-stated)·Auto-generated by claude-haiku-4-5-20251001(?)
  • Explore domains requiring more advanced discount model architectures
    from the paper

Author keywords

  • quality diversity optimization
  • black-box optimization
  • derivative-free optimization
  • latent space exploration

Related orals

Something off? Let us know →