Genome Biology (Aug 2022)

BindVAE: Dirichlet variational autoencoders for de novo motif discovery from accessible chromatin

  • Meghana Kshirsagar,
  • Han Yuan,
  • Juan Lavista Ferres,
  • Christina Leslie

DOI
https://doi.org/10.1186/s13059-022-02723-w
Journal volume & issue
Vol. 23, no. 1
pp. 1 – 24

Abstract

Read online

Abstract We present a novel unsupervised deep learning approach called BindVAE, based on Dirichlet variational autoencoders, for jointly decoding multiple TF binding signals from open chromatin regions. BindVAE can disentangle an input DNA sequence into distinct latent factors that encode cell-type specific in vivo binding signals for individual TFs, composite patterns for TFs involved in cooperative binding, and genomic context surrounding the binding sites. On the task of retrieving the motifs of expressed TFs in a given cell type, BindVAE is competitive with existing motif discovery approaches.

Keywords