Systematic analysis of naturally occurring insertions and deletions that alter transcription factor spacing identifies tolerant and sensitive transcription factor pairs
Department of Cellular and Molecular Medicine, School of Medicine, University of California San Diego, La Jolla, United States; Department of Bioengineering, Jacobs School of Engineering, University of California San Diego, La Jolla, United States
Department of Cellular and Molecular Medicine, School of Medicine, University of California San Diego, La Jolla, United States; Department of Medical Biochemistry, Experimental Vascular Biology, Amsterdam Infection and Immunity, Amsterdam Cardiovascular Sciences, Amsterdam UMC, University of Amsterdam, Amsterdam, Netherlands
Nathan J Spann
Department of Cellular and Molecular Medicine, School of Medicine, University of California San Diego, La Jolla, United States
Jenhan Tao
Department of Cellular and Molecular Medicine, School of Medicine, University of California San Diego, La Jolla, United States
Gregory J Fonseca
Department of Cellular and Molecular Medicine, School of Medicine, University of California San Diego, La Jolla, United States; Department of Medicine, McGill University, Montreal, Canada
Thomas Le
Division of Biological Sciences, University of California San Diego, La Jolla, United States
Lindsey K Stolze
Department of Cellular and Molecular Medicine, College of Medicine, University of Arizona, Tucson, United States
Department of Cellular and Molecular Medicine, School of Medicine, University of California San Diego, La Jolla, United States; Department of Biochemistry and Molecular Biology, Nippon Medical School, Tokyo, Japan
Department of Cellular and Molecular Medicine, School of Medicine, University of California San Diego, La Jolla, United States; Department of Medicine, School of Medicine, University of California San Diego, La Jolla, United States
Regulation of gene expression requires the combinatorial binding of sequence-specific transcription factors (TFs) at promoters and enhancers. Prior studies showed that alterations in the spacing between TF binding sites can influence promoter and enhancer activity. However, the relative importance of TF spacing alterations resulting from naturally occurring insertions and deletions (InDels) has not been systematically analyzed. To address this question, we first characterized the genome-wide spacing relationships of 73 TFs in human K562 cells as determined by ChIP-seq (chromatin immunoprecipitation sequencing). We found a dominant pattern of a relaxed range of spacing between collaborative factors, including 45 TFs exclusively exhibiting relaxed spacing with their binding partners. Next, we exploited millions of InDels provided by genetically diverse mouse strains and human individuals to investigate the effects of altered spacing on TF binding and local histone acetylation. These analyses suggested that spacing alterations resulting from naturally occurring InDels are generally tolerated in comparison to genetic variants directly affecting TF binding sites. To experimentally validate this prediction, we introduced synthetic spacing alterations between PU.1 and C/EBPβ binding sites at six endogenous genomic loci in a macrophage cell line. Remarkably, collaborative binding of PU.1 and C/EBPβ at these locations tolerated changes in spacing ranging from 5 bp increase to >30 bp decrease. Collectively, these findings have implications for understanding mechanisms underlying enhancer selection and for the interpretation of non-coding genetic variation.