BMC Evolutionary Biology (Dec 2001)

High correlation between the turnover of nucleotides under mutational pressure and the DNA composition

  • Dudkiewicz Malgorzata,
  • Nowicka Aleksandra,
  • Mackiewicz Dorota,
  • Mackiewicz Pawel,
  • Kowalczuk Maria,
  • Dudek Miroslaw R,
  • Cebrat Stanislaw

DOI
https://doi.org/10.1186/1471-2148-1-13
Journal volume & issue
Vol. 1, no. 1
p. 13

Abstract

Read online

Abstract Background Any DNA sequence is a result of compromise between the selection and mutation pressures exerted on it during evolution. It is difficult to estimate the relative influence of each of these pressures on the rate of accumulation of substitutions. However, it is important to discriminate between the effect of mutations, and the effect of selection, when studying the phylogenic relations between taxa. Results We have tested in computer simulations, and analytically, the available substitution matrices for many genomes, and we have found that DNA strands in equilibrium under mutational pressure have unique feature: the fraction of each type of nucleotide is linearly dependent on the time needed for substitution of half of nucleotides of a given type, with a correlation coefficient close to 1. Substitution matrices found for sequences under selection pressure do not have this property. A substitution matrix for the leading strand of the Borrelia burgdorferi genome, having reached equilibrium in computer simulation, gives a DNA sequence with nucleotide composition and asymmetry corresponding precisely to the third positions in codons of protein coding genes located on the leading strand. Conclusions Parameters of mutational pressure allow us to count DNA composition in equilibrium with this mutational pressure. Comparing any real DNA sequence with the sequence in equilibrium it is possible to estimate the distance between these sequences, which could be used as a measure of the selection pressure. Furthermore, the parameters of the mutational pressure enable direct estimation of the relative mutation rates in any DNA sequence in the studied genome.