BMC Bioinformatics (Mar 2005)

GeneKeyDB: A lightweight, gene-centric, relational database to support data mining environments

  • Zhang B,
  • Schmoyer D,
  • Baker E,
  • Peng X,
  • Kirov SA,
  • Snoddy J

DOI
https://doi.org/10.1186/1471-2105-6-72
Journal volume & issue
Vol. 6, no. 1
p. 72

Abstract

Read online

Abstract Background The analysis of biological data is greatly enhanced by existing or emerging databases. Most existing databases, with few exceptions are not designed to easily support large scale computational analysis, but rather offer exclusively a web interface to the resource. We have recognized the growing need for a database which can be used successfully as a backend to computational analysis tools and pipelines. Such database should be sufficiently versatile to allow easy system integration. Results GeneKeyDB is a gene-centered relational database developed to enhance data mining in biological data sets. The system provides an underlying data layer for computational analysis tools and visualization tools. GeneKeyDB relies primarily on existing database identifiers derived from community databases (NCBI, GO, Ensembl, et al.) as well as the known relationships among those identifiers. It is a lightweight, portable, and extensible platform for integration with computational tools and analysis environments. Conclusion GeneKeyDB can enable analysis tools and users to manipulate the intersections, unions, and differences among different data sets.