BMC Biology (Aug 2023)

Emergence and influence of sequence bias in evolutionarily malleable, mammalian tandem arrays

  • Margarita V. Brovkina,
  • Margaret A. Chapman,
  • Matthew L. Holding,
  • E. Josephine Clowney

DOI
https://doi.org/10.1186/s12915-023-01673-4
Journal volume & issue
Vol. 21, no. 1
pp. 1 – 28

Abstract

Read online

Abstract Background The radiation of mammals at the extinction of the dinosaurs produced a plethora of new forms—as diverse as bats, dolphins, and elephants—in only 10–20 million years. Behind the scenes, adaptation to new niches is accompanied by extensive innovation in large families of genes that allow animals to contact the environment, including chemosensors, xenobiotic enzymes, and immune and barrier proteins. Genes in these “outward-looking” families are allelically diverse among humans and exhibit tissue-specific and sometimes stochastic expression. Results Here, we show that these tandem arrays of outward-looking genes occupy AT-biased isochores and comprise the “tissue-specific” gene class that lack CpG islands in their promoters. Models of mammalian genome evolution have not incorporated the sharply different functions and transcriptional patterns of genes in AT- versus GC-biased regions. To examine the relationship between gene family expansion, sequence content, and allelic diversity, we use population genetic data and comparative analysis. First, we find that AT bias can emerge during evolutionary expansion of gene families in cis. Second, human genes in AT-biased isochores or with GC-poor promoters experience relatively low rates of de novo point mutation today but are enriched for non-synonymous variants. Finally, we find that isochores containing gene clusters exhibit low rates of recombination. Conclusions Our analyses suggest that tolerance of non-synonymous variation and low recombination are two forces that have produced the depletion of GC bases in outward-facing gene arrays. In turn, high AT content exerts a profound effect on their chromatin organization and transcriptional regulation.

Keywords