Nature Communications (Nov 2024)

ChromaFold predicts the 3D contact map from single-cell chromatin accessibility

  • Vianne R. Gao,
  • Rui Yang,
  • Arnav Das,
  • Renhe Luo,
  • Hanzhi Luo,
  • Dylan R. McNally,
  • Ioannis Karagiannidis,
  • Martin A. Rivas,
  • Zhong-Min Wang,
  • Darko Barisic,
  • Alireza Karbalayghareh,
  • Wilfred Wong,
  • Yingqian A. Zhan,
  • Christopher R. Chin,
  • William S. Noble,
  • Jeff A. Bilmes,
  • Effie Apostolou,
  • Michael G. Kharas,
  • Wendy Béguelin,
  • Aaron D. Viny,
  • Danwei Huangfu,
  • Alexander Y. Rudensky,
  • Ari M. Melnick,
  • Christina S. Leslie

DOI
https://doi.org/10.1038/s41467-024-53628-0
Journal volume & issue
Vol. 15, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Identifying cell-type-specific 3D chromatin interactions between regulatory elements can help decipher gene regulation and interpret disease-associated non-coding variants. However, achieving this resolution with current 3D genomics technologies is often infeasible given limited input cell numbers. We therefore present ChromaFold, a deep learning model that predicts 3D contact maps, including regulatory interactions, from single-cell ATAC sequencing (scATAC-seq) data alone. ChromaFold uses pseudobulk chromatin accessibility, co-accessibility across metacells, and a CTCF motif track as inputs and employs a lightweight architecture to train on standard GPUs. Trained on paired scATAC-seq and Hi-C data in human samples, ChromaFold accurately predicts the 3D contact map and peak-level interactions across diverse human and mouse test cell types. Compared to leading contact map prediction models that use ATAC-seq and CTCF ChIP-seq, ChromaFold achieves state-of-the-art performance using only scATAC-seq. Finally, fine-tuning ChromaFold on paired scATAC-seq and Hi-C in a complex tissue enables deconvolution of chromatin interactions across cell subpopulations.