PLoS Genetics (Apr 2018)

Degenerate Pax2 and Senseless binding motifs improve detection of low-affinity sites required for enhancer specificity.

  • Arya Zandvakili,
  • Ian Campbell,
  • Lisa M Gutzwiller,
  • Matthew T Weirauch,
  • Brian Gebelein

DOI
https://doi.org/10.1371/journal.pgen.1007289
Journal volume & issue
Vol. 14, no. 4
p. e1007289

Abstract

Read online

Cells use thousands of regulatory sequences to recruit transcription factors (TFs) and produce specific transcriptional outcomes. Since TFs bind degenerate DNA sequences, discriminating functional TF binding sites (TFBSs) from background sequences represents a significant challenge. Here, we show that a Drosophila regulatory element that activates Epidermal Growth Factor signaling requires overlapping, low-affinity TFBSs for competing TFs (Pax2 and Senseless) to ensure cell- and segment-specific activity. Testing available TF binding models for Pax2 and Senseless, however, revealed variable accuracy in predicting such low-affinity TFBSs. To better define parameters that increase accuracy, we developed a method that systematically selects subsets of TFBSs based on predicted affinity to generate hundreds of position-weight matrices (PWMs). Counterintuitively, we found that degenerate PWMs produced from datasets depleted of high-affinity sequences were more accurate in identifying both low- and high-affinity TFBSs for the Pax2 and Senseless TFs. Taken together, these findings reveal how TFBS arrangement can be constrained by competition rather than cooperativity and that degenerate models of TF binding preferences can improve identification of biologically relevant low affinity TFBSs.