Quantitative perturbation-based analysis of gene expression predicts enhancer activity in early Drosophila embryo
Rupinder Sayal,
Jacqueline M Dresch,
Irina Pushel,
Benjamin R Taylor,
David N Arnosti
Affiliations
Rupinder Sayal
Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, United States; Department of Biochemistry, DAV University, Jalandhar, India
Jacqueline M Dresch
Department of Mathematics, Michigan State University, East Lansing, United States; Department of Mathematics and Computer Science, Clark University, Worcester, United States
Irina Pushel
Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, United States; Stowers Institute for Medical Research, Kansas City, United States
Benjamin R Taylor
Department of Computer Science and Engineering, Michigan State University, East Lansing, United States; School of Computer Science, Georgia Institute of Technology, Atlanta, United States
Enhancers constitute one of the major components of regulatory machinery of metazoans. Although several genome-wide studies have focused on finding and locating enhancers in the genomes, the fundamental principles governing their internal architecture and cis-regulatory grammar remain elusive. Here, we describe an extensive, quantitative perturbation analysis targeting the dorsal-ventral patterning gene regulatory network (GRN) controlled by Drosophila NF-κB homolog Dorsal. To understand transcription factor interactions on enhancers, we employed an ensemble of mathematical models, testing effects of cooperativity, repression, and factor potency. Models trained on the dataset correctly predict activity of evolutionarily divergent regulatory regions, providing insights into spatial relationships between repressor and activator binding sites. Importantly, the collective predictions of sets of models were effective at novel enhancer identification and characterization. Our study demonstrates how experimental dataset and modeling can be effectively combined to provide quantitative insights into cis-regulatory information on a genome-wide scale.