Entropy (Jun 2020)

Information Theory in Computational Biology: Where We Stand Today

  • Pritam Chanda,
  • Eduardo Costa,
  • Jie Hu,
  • Shravan Sukumar,
  • John Van Hemert,
  • Rasna Walia

DOI
https://doi.org/10.3390/e22060627
Journal volume & issue
Vol. 22, no. 6
p. 627

Abstract

Read online

“A Mathematical Theory of Communication” was published in 1948 by Claude Shannon to address the problems in the field of data compression and communication over (noisy) communication channels. Since then, the concepts and ideas developed in Shannon’s work have formed the basis of information theory, a cornerstone of statistical learning and inference, and has been playing a key role in disciplines such as physics and thermodynamics, probability and statistics, computational sciences and biological sciences. In this article we review the basic information theory based concepts and describe their key applications in multiple major areas of research in computational biology—gene expression and transcriptomics, alignment-free sequence comparison, sequencing and error correction, genome-wide disease-gene association mapping, metabolic networks and metabolomics, and protein sequence, structure and interaction analysis.

Keywords