Cell Reports (Dec 2019)

Quantification of Differential Transcription Factor Activity and Multiomics-Based Classification into Activators and Repressors: diffTF

  • Ivan Berest,
  • Christian Arnold,
  • Armando Reyes-Palomares,
  • Giovanni Palla,
  • Kasper Dindler Rasmussen,
  • Holly Giles,
  • Peter-Martin Bruch,
  • Wolfgang Huber,
  • Sascha Dietrich,
  • Kristian Helin,
  • Judith B. Zaugg

Journal volume & issue
Vol. 29, no. 10
pp. 3147 – 3159.e12

Abstract

Read online

Summary: Transcription factors (TFs) regulate many cellular processes and can therefore serve as readouts of the signaling and regulatory state. Yet for many TFs, the mode of action—repressing or activating transcription of target genes—is unclear. Here, we present diffTF (https://git.embl.de/grp-zaugg/diffTF) to calculate differential TF activity (basic mode) and classify TFs into putative transcriptional activators or repressors (classification mode). In basic mode, it combines genome-wide chromatin accessibility/activity with putative TF binding sites that, in classification mode, are integrated with RNA-seq. We apply diffTF to compare (1) mutated and unmutated chronic lymphocytic leukemia patients and (2) two hematopoietic progenitor cell types. In both datasets, diffTF recovers most known biology and finds many previously unreported TFs. It classifies almost 40% of TFs based on their mode of action, which we validate experimentally. Overall, we demonstrate that diffTF recovers known biology, identifies less well-characterized TFs, and classifies TFs into transcriptional activators or repressors. : Berest et al. present a computational tool (diffTF) to estimate differential TF activity and classify TFs into activators or repressors. It requires active chromatin data (accessibility/ChIP-seq) and integrates with RNA-seq for classification. The authors apply it to two case studies (CLL and hematopoietic differentiation) and validate their predictions experimentally. Keywords: transcription factor, ATAC-seq, CLL, transcriptional activator and repressor, RNA-seq, TF footprint, open chromatin, snakemake, multiomics data integration