Probabilistic Joint and Individual Variation Explained (ProJIVE) for Data Integration

Raphiel J. Murden et al.

Journal of Computational and Graphical Statistics2026https://doi.org/10.1080/10618600.2026.2639081article
AJG 3ABDC A*
Weight
0.50

Abstract

Collecting multiple types of data on the same set of subjects is common in modern scientific applications including, genomics, metabolomics, and neuroimaging. Joint and Individual Variance Explained (JIVE) seeks a low-rank approximation of the joint variation between two or more sets of features captured on common subjects and isolates this variation from that unique to eachset of features. We develop an expectation-maximization (EM) algorithm to estimate a probabilistic model for the JIVE framework. The model extends probabilistic principal components analysis to multiple data sets. Our maximum likelihood approach simultaneously estimates joint and individual components, which can lead to greater accuracy compared to other methods. We apply ProJIVE to measures of brain morphometry and cognition in Alzheimer's disease. ProJIVE learns biologically meaningful courses of variation, and the joint morphometry and cognition subject scores are strongly related to more expensive existing biomarkers. Data used in preparation of this article were obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database. Code to reproduce the analysis is available on our GitHub page.

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1080/10618600.2026.2639081

Or copy a formatted citation

@article{raphiel2026,
  title        = {{Probabilistic Joint and Individual Variation Explained (ProJIVE) for Data Integration}},
  author       = {Raphiel J. Murden et al.},
  journal      = {Journal of Computational and Graphical Statistics},
  year         = {2026},
  doi          = {https://doi.org/https://doi.org/10.1080/10618600.2026.2639081},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

Probabilistic Joint and Individual Variation Explained (ProJIVE) for Data Integration

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.50 × 0.4 = 0.20
M · momentum0.50 × 0.15 = 0.07
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.