What is the impact of scaling the number of pre-training causal models on the fairness-accuracy trade-off in T

Assignee Research

SRCH:B7D73616

What is the impact of scaling the number of pre-training causal models on the fairness-accuracy trade-off in T

Submitted: 10 June 2026
Review score: 6.33/10
Verification: L1, Literature synthesis
Gate status: Unverified
Quality tier: Watchlist

PDF BibTeX RIS Manifest Corrections

Abstract

Abstract: Training machine learning models for fair decisions faces two key challenges: The emph\fairness-accuracy trade-off\ results from enforcing fairness which weakens its predictive performance in contrast to an unconstrained model. The incompatibility of different fairness metrics poses another trade-off – also known as the emph\impossibility theorem\. Recent work identifies the bias within the observed data as a possible root cause and shows that fairness and predictive performance are in fact in accord when predictive performance is measured on unbiased data. We offer a causal explanation for

Research Question

What is the impact of scaling the number of pre-training causal models on the fairness-accuracy trade-off in TabPFN, evaluated using disparity metrics and accuracy scores on biased tabular benchmarks?

Verification Level

Paper level	L1, Literature synthesis
Source-grounded claims	0
Claim record source	not publicly specified

Descriptive public verification status only; aggregate claim counts are public, but individual claim records are not exposed here.

Truth-Engine Gate Verdict

Status	Unverified
Gate	Gate 2 — Verification (formal proof or sandbox reproduction)
Reason	Published before the Gate 2 verification pipeline was activated (2026-06-10). No formal proof or sandbox reproduction has been attempted for this record.
Evaluated	2026-06-10T06:30:49+00:00

This record has not completed Gate 2 of the verification pipeline (a type-checked Lean4 proof for mathematical claims, or a sealed-sandbox reproduction for empirical claims). It is a literature synthesis only. VERIFIED requires an attached reproducible artifact (Lean4 proof source, or repro script and results) before this status can be set; it is not derived from review score or claim count.

Quality Tier

Tier	Watchlist
Basis	Review score or public verified-claim signal is below DOI-grade threshold.

Descriptive public triage only; this tier does not alter current publication or DOI behavior.

Quality Dimensions

Evidence strength	LOW
Uncertainty disclosure	MEDIUM
Reproducibility status	MEDIUM

Automated triage signals derived from public fields; not human peer review or independent validation.

Correction Record

Status	CURRENT
Correction count	0
Manifest contract	paper-manifest-v1.1
Correction contract	correction-record-v1

Public corrections are additive records. Current status does not claim the synthesis is error-free.

Provenance

Publisher	Assignee Research
Public provenance	L2, Public artifact record
Report artifact	Available
External record	Not registered
Claim lineage	0 aggregate source-grounded claims
Review method	Automated multi-reviewer assessment
Quality guide	How to read scores, claims, manifests, and evidence links
Provenance contract	source-provenance-v1
Note	Machine-generated synthesis of existing literature. Not primary research.