Life (Mar 2022)

Why Is the UAG (Amber) Stop Codon Almost Absent in Highly Expressed Bacterial Genes?

  • Dominique Belin,
  • Pere Puigbò

DOI
https://doi.org/10.3390/life12030431
Journal volume & issue
Vol. 12, no. 3
p. 431

Abstract

Read online

The genome hypothesis postulates that genes in a genome tend to conform to their species’ usage of the codon catalog and the GC content of the DNA. Thus, codon frequencies differ across organisms, including the three termination codons in the standard genetic code. Here, we analyze the frequencies of stop codons in a group of highly expressed genes from 196 prokaryotes under strong translational selection. The occurrence of the three translation termination codons is highly biased, with UAA (ochre) being the most prevalent in almost all bacteria. In contrast, UAG (amber) is the least frequent termination codon, e.g., only 321 occurrences (7.4%) in E. coli K-12 substr. W3110. Of the 253 highly expressed genes, only two end with an UAG codon. The strength of the selective bias against UAG in highly expressed genes varies among bacterial genomes, but it is not affected by the GC content of these genomes. In contrast, increased GC content results in a decrease in UAA abundance with a concomitant increase in UGA abundance. We propose that readthrough efficiency and context effects could explain the prevalence of UAA over UAG, particularly in highly expressed genes. Findings from this communication can be utilized for the optimization of gene expression.

Keywords