PLoS Computational Biology (Aug 2021)

Discovering differential genome sequence activity with interpretable and efficient deep learning.

  • Jennifer Hammelman,
  • David K Gifford

DOI
https://doi.org/10.1371/journal.pcbi.1009282
Journal volume & issue
Vol. 17, no. 8
p. e1009282

Abstract

Read online

Discovering sequence features that differentially direct cells to alternate fates is key to understanding both cellular development and the consequences of disease related mutations. We introduce Expected Pattern Effect and Differential Expected Pattern Effect, two black-box methods that can interpret genome regulatory sequences for cell type-specific or condition specific patterns. We show that these methods identify relevant transcription factor motifs and spacings that are predictive of cell state-specific chromatin accessibility. Finally, we integrate these methods into framework that is readily accessible to non-experts and available for download as a binary or installed via PyPI or bioconda at https://cgs.csail.mit.edu/deepaccess-package/.