TabStruct: Measuring Structural Fidelity of Tabular Data

Xiangjian Jiang, Nikola Simidjievski, Mateja Jamnik

Datasets, Benchmarks & Evaluation Thu, Apr 23 · 11:18 AM–11:28 AM · 202 A/B Avg rating: 7.00 (4–10)

Author-provided TL;DR

We propose TabStruct, a comprehensive benchmark, along with a novel metric, global utility, for evaluating the structural fidelity of tabular data without requiring access to ground-truth causal structures.

Abstract

Evaluating tabular generators remains a challenging problem, as the unique causal structural prior of heterogeneous tabular data does not lend itself to intuitive human inspection. Recent work has introduced structural fidelity as a tabular-specific evaluation dimension to assess whether synthetic data complies with the causal structures of real data. However, existing benchmarks often neglect the interplay between structural fidelity and conventional evaluation dimensions, thus failing to provide a holistic understanding of model performance. Moreover, they are typically limited to toy datasets, as quantifying existing structural fidelity metrics requires access to ground-truth causal structures, which are rarely available for real-world datasets. In this paper, we propose a novel evaluation framework that jointly considers structural fidelity and conventional evaluation dimensions. We introduce a new evaluation metric, global utility, which enables the assessment of structural fidelity even in the absence of ground-truth causal structures. In addition, we present TabStruct, a comprehensive evaluation benchmark offering large-scale quantitative analysis on 13 tabular generators from nine distinct categories, across 29 datasets. Our results demonstrate that global utility provides a task-independent, domain-agnostic lens for tabular generator performance. We release the TabStruct benchmark suite, including all datasets, evaluation pipelines, and raw results. Code is available at https://github.com/SilenceX12138/TabStruct.

One-sentence summary·Auto-generated by claude-haiku-4-5-20251001(?)

TabStruct benchmark evaluates tabular data generators on structural fidelity and conventional dimensions using global utility metric without ground-truth causal structures.

Contributions·Auto-generated by claude-haiku-4-5-20251001(?)

Global utility metric enabling structural fidelity assessment without ground-truth causal structures
Comprehensive benchmark with 13 generators across 29 datasets and 9 distinct categories
Finding that existing evaluation methods favor local causal interactions while neglecting global structure

Methods used·Auto-generated by claude-haiku-4-5-20251001(?)

structural fidelity evaluation
global utility metric
causal structure assessment

Datasets used·Auto-generated by claude-haiku-4-5-20251001(?)

29 tabular datasets

Limitations (author-stated)·Auto-generated by claude-haiku-4-5-20251001(?)

Authors did not state explicit limitations.

Future work (author-stated)·Auto-generated by claude-haiku-4-5-20251001(?)

Add additional datasets, generators, and evaluation metrics through community engagement
from the paper

Author keywords

Tabular data
Tabular data structure
Synthetic data generation

Something off? Let us know →

TabStruct: Measuring Structural Fidelity of Tabular Data

Abstract

Author keywords

Related orals

On the Wasserstein Geodesic Principal Component Analysis of probability measures

Monocular Normal Estimation via Shading Sequence Estimation

TTSDS2: Resources and Benchmark for Evaluating Human-Quality Text to Speech Systems

World-In-World: World Models in a Closed-Loop World

EditBench: Evaluating LLM Abilities to Perform Real-World Instructed Code Edits