The synthetic instrument: from sparse association to sparse causation

Dingke Tang et al.

Journal of the Royal Statistical Society. Series B: Statistical Methodology2026https://doi.org/10.1093/jrsssb/qkaf083article
AJG 4
Weight
0.50

Abstract

In many observational studies, researchers are often interested in the effects of multiple exposures on a single outcome. Standard approaches for high-dimensional data, such as the Lasso, assume that the associations between the exposures and the outcome are sparse. However, these methods do not estimate causal effects in the presence of unmeasured confounding. In this paper, we consider an alternative approach that assumes the causal effects under consideration are sparse. We show that under sparse causation, causal effects are identifiable even with unmeasured confounding. Our proposal is built around a novel device called the synthetic instrument, which, in contrast to standard instrumental variables, can be constructed directly from the observed exposures. We demonstrate that, under the assumption of sparse causation, the problem of causal effect estimation can be formulated as an ℓ0-penalization problem and solved efficiently using off-the-shelf software. Simulations show that our approach outperforms state-of-the-art methods in both low- and high-dimensional settings. We further illustrate our method using a mouse obesity dataset.

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1093/jrsssb/qkaf083

Or copy a formatted citation

@article{dingke2026,
  title        = {{The synthetic instrument: from sparse association to sparse causation}},
  author       = {Dingke Tang et al.},
  journal      = {Journal of the Royal Statistical Society. Series B: Statistical Methodology},
  year         = {2026},
  doi          = {https://doi.org/https://doi.org/10.1093/jrsssb/qkaf083},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

The synthetic instrument: from sparse association to sparse causation

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.50 × 0.4 = 0.20
M · momentum0.50 × 0.15 = 0.07
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.