Quantification of Differential Transcription Factor Activity and Multiomics-Based Classification into Activators and Repressors: diffTF
Ivan Berest,
Christian Arnold,
Armando Reyes-Palomares,
Giovanni Palla,
Kasper Dindler Rasmussen,
Holly Giles,
Peter-Martin Bruch,
Wolfgang Huber,
Sascha Dietrich,
Kristian Helin,
Judith B. Zaugg
Affiliations
Ivan Berest
Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany; Collaboration for joint PhD degree between EMBL and Heidelberg University, Faculty of Biosciences, Heidelberg, Germany
Christian Arnold
Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
Armando Reyes-Palomares
Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
Giovanni Palla
Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
Kasper Dindler Rasmussen
School of Life Sciences, University of Dundee, Dundee, UK; Biotech Research and Innovation Centre (BRIC), University of Copenhagen, Copenhagen, Denmark
Holly Giles
Heidelberg University Hospital, Heidelberg, Germany; Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany; Collaboration for joint PhD degree between EMBL and Heidelberg University, Faculty of Biosciences, Heidelberg, Germany
Peter-Martin Bruch
Heidelberg University Hospital, Heidelberg, Germany
Wolfgang Huber
Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany; Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany; European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
Sascha Dietrich
Heidelberg University Hospital, Heidelberg, Germany
Kristian Helin
Biotech Research and Innovation Centre (BRIC), University of Copenhagen, Copenhagen, Denmark; Novo Nordisk Foundation Center for Stem Cell Biology, Copenhagen, Denmark; Cell Biology Program and Center for Epigenetics Research, Memorial Sloan Kettering Cancer Center, New York, NY, USA
Judith B. Zaugg
Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany; Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany; European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK; Corresponding author
Summary: Transcription factors (TFs) regulate many cellular processes and can therefore serve as readouts of the signaling and regulatory state. Yet for many TFs, the mode of action—repressing or activating transcription of target genes—is unclear. Here, we present diffTF (https://git.embl.de/grp-zaugg/diffTF) to calculate differential TF activity (basic mode) and classify TFs into putative transcriptional activators or repressors (classification mode). In basic mode, it combines genome-wide chromatin accessibility/activity with putative TF binding sites that, in classification mode, are integrated with RNA-seq. We apply diffTF to compare (1) mutated and unmutated chronic lymphocytic leukemia patients and (2) two hematopoietic progenitor cell types. In both datasets, diffTF recovers most known biology and finds many previously unreported TFs. It classifies almost 40% of TFs based on their mode of action, which we validate experimentally. Overall, we demonstrate that diffTF recovers known biology, identifies less well-characterized TFs, and classifies TFs into transcriptional activators or repressors. : Berest et al. present a computational tool (diffTF) to estimate differential TF activity and classify TFs into activators or repressors. It requires active chromatin data (accessibility/ChIP-seq) and integrates with RNA-seq for classification. The authors apply it to two case studies (CLL and hematopoietic differentiation) and validate their predictions experimentally. Keywords: transcription factor, ATAC-seq, CLL, transcriptional activator and repressor, RNA-seq, TF footprint, open chromatin, snakemake, multiomics data integration