Nature Communications (Feb 2025)

INSTINCT: Multi-sample integration of spatial chromatin accessibility sequencing data via stochastic domain translation

  • Yuyao Liu,
  • Zhen Li,
  • Xiaoyang Chen,
  • Xuejian Cui,
  • Zijing Gao,
  • Rui Jiang

DOI
https://doi.org/10.1038/s41467-025-56535-0
Journal volume & issue
Vol. 16, no. 1
pp. 1 – 23

Abstract

Read online

Abstract Recent advances in spatial epigenomic techniques have given rise to spatial assay for transposase-accessible chromatin using sequencing (spATAC-seq) data, enabling the characterization of epigenomic heterogeneity and spatial information simultaneously. Integrative analysis of multiple spATAC-seq samples, for which no method has been developed, allows for effective identification and elimination of unwanted non-biological factors within the data, enabling comprehensive exploration of tissue structures and providing a holistic epigenomic landscape, thereby facilitating the discovery of biological implications and the study of regulatory processes. In this article, we present INSTINCT, a method for multi-sample INtegration of Spatial chromaTIN accessibility sequencing data via stochastiC domain Translation. INSTINCT can efficiently handle the high dimensionality of spATAC-seq data and eliminate the complex noise and batch effects of samples through a stochastic domain translation procedure. We demonstrate the superiority and robustness of INSTINCT in integrating spATAC-seq data across multiple simulated scenarios and real datasets. Additionally, we highlight the advantages of INSTINCT in spatial domain identification, visualization, spot-type annotation, and various downstream analyses, including motif enrichment analysis, expression enrichment analysis, and partitioned heritability analysis.