PLoS Biology (May 2010)

A large fraction of extragenic RNA pol II transcription sites overlap enhancers.

  • Francesca De Santa,
  • Iros Barozzi,
  • Flore Mietton,
  • Serena Ghisletti,
  • Sara Polletti,
  • Betsabeh Khoramian Tusi,
  • Heiko Muller,
  • Jiannis Ragoussis,
  • Chia-Lin Wei,
  • Gioacchino Natoli

DOI
https://doi.org/10.1371/journal.pbio.1000384
Journal volume & issue
Vol. 8, no. 5
p. e1000384

Abstract

Read online

Mammalian genomes are pervasively transcribed outside mapped protein-coding genes. One class of extragenic transcription products is represented by long non-coding RNAs (lncRNAs), some of which result from Pol_II transcription of bona-fide RNA genes. Whether all lncRNAs described insofar are products of RNA genes, however, is still unclear. Here we have characterized transcription sites located outside protein-coding genes in a highly regulated response, macrophage activation by endotoxin. Using chromatin signatures, we could unambiguously classify extragenic Pol_II binding sites as belonging to either canonical RNA genes or transcribed enhancers. Unexpectedly, 70% of extragenic Pol_II peaks were associated with genomic regions with a canonical chromatin signature of enhancers. Enhancer-associated extragenic transcription was frequently adjacent to inducible inflammatory genes, was regulated in response to endotoxin stimulation, and generated very low abundance transcripts. Moreover, transcribed enhancers were under purifying selection and contained binding sites for inflammatory transcription factors, thus suggesting their functionality. These data demonstrate that a large fraction of extragenic Pol_II transcription sites can be ascribed to cis-regulatory genomic regions. Discrimination between lncRNAs generated by canonical RNA genes and products of transcribed enhancers will provide a framework for experimental approaches to lncRNAs and help complete the annotation of mammalian genomes.