BMC Bioinformatics (Jul 2005)

Information theory-based algorithm for <it>in silico </it>prediction of PCR products with whole genomic sequences as templates

  • He Junjian,
  • Wei Guifang,
  • Zhang Yulei,
  • Kou Chunhai,
  • Xu Kexue,
  • Wang Lianjie,
  • Cao Youfang,
  • Wang Yunfang,
  • Zhao Liping

DOI
https://doi.org/10.1186/1471-2105-6-190
Journal volume & issue
Vol. 6, no. 1
p. 190

Abstract

Read online

Abstract Background A new algorithm for assessing similarity between primer and template has been developed based on the hypothesis that annealing of primer to template is an information transfer process. Results Primer sequence is converted to a vector of the full potential hydrogen numbers (3 for G or C, 2 for A or T), while template sequence is converted to a vector of the actual hydrogen bond numbers formed after primer annealing. The former is considered as source information and the latter destination information. An information coefficient is calculated as a measure for fidelity of this information transfer process and thus a measure of similarity between primer and potential annealing site on template. Conclusion Successful prediction of PCR products from whole genomic sequences with a computer program based on the algorithm demonstrated the potential of this new algorithm in areas like in silico PCR and gene finding.