Genome Biology (Jun 2024)

CelFiE-ISH: a probabilistic model for multi-cell type deconvolution from single-molecule DNA methylation haplotypes

  • Irene Unterman,
  • Dana Avrahami,
  • Efrat Katsman,
  • Triche Tim Jr.,
  • Benjamin Glaser,
  • Benjamin P. Berman

DOI
https://doi.org/10.1186/s13059-024-03275-x
Journal volume & issue
Vol. 25, no. 1
pp. 1 – 27

Abstract

Read online

Abstract Deconvolution methods infer quantitative cell type estimates from bulk measurement of mixed samples including blood and tissue. DNA methylation sequencing measures multiple CpGs per read, but few existing deconvolution methods leverage this within-read information. We develop CelFiE-ISH, which extends an existing method (CelFiE) to use within-read haplotype information. CelFiE-ISH outperforms CelFiE and other existing methods, achieving 30% better accuracy and more sensitive detection of rare cell types. We also demonstrate the importance of marker selection and of tailoring markers for haplotype-aware methods. While here we use gold-standard short-read sequencing data, haplotype-aware methods will be well-suited for long-read sequencing.

Keywords