Ensembl variation resources

Marin-Garcia Pablo; Kulesha Eugene; Brent Simon; Spudich Giulietta M; Pritchard Bethan; Smith James; McLaren William M; Rios Daniel; Cunningham Fiona; Chen Yuan; Smedley Damian; Birney Ewan; Flicek Paul

doi:10.1186/1471-2164-11-293

BMC Genomics (May 2010)

Ensembl variation resources

Marin-Garcia Pablo,
Kulesha Eugene,
Brent Simon,
Spudich Giulietta M,
Pritchard Bethan,
Smith James,
McLaren William M,
Rios Daniel,
Cunningham Fiona,
Chen Yuan,
Smedley Damian,
Birney Ewan,
Flicek Paul

Affiliations

Marin-Garcia Pablo
Kulesha Eugene
Brent Simon
Spudich Giulietta M
Pritchard Bethan
Smith James
McLaren William M
Rios Daniel
Cunningham Fiona
Chen Yuan
Smedley Damian
Birney Ewan
Flicek Paul

DOI: https://doi.org/10.1186/1471-2164-11-293
Journal volume & issue: Vol. 11, no. 1
p. 293

Abstract

Read online

Abstract Background The maturing field of genomics is rapidly increasing the number of sequenced genomes and producing more information from those previously sequenced. Much of this additional information is variation data derived from sampling multiple individuals of a given species with the goal of discovering new variants and characterising the population frequencies of the variants that are already known. These data have immense value for many studies, including those designed to understand evolution and connect genotype to phenotype. Maximising the utility of the data requires that it be stored in an accessible manner that facilitates the integration of variation data with other genome resources such as gene annotation and comparative genomics. Description The Ensembl project provides comprehensive and integrated variation resources for a wide variety of chordate genomes. This paper provides a detailed description of the sources of data and the methods for creating the Ensembl variation databases. It also explores the utility of the information by explaining the range of query options available, from using interactive web displays, to online data mining tools and connecting directly to the data servers programmatically. It gives a good overview of the variation resources and future plans for expanding the variation data within Ensembl. Conclusions Variation data is an important key to understanding the functional and phenotypic differences between individuals. The development of new sequencing and genotyping technologies is greatly increasing the amount of variation data known for almost all genomes. The Ensembl variation resources are integrated into the Ensembl genome browser and provide a comprehensive way to access this data in the context of a widely used genome bioinformatics system. All Ensembl data is freely available at http://www.ensembl.org and from the public MySQL database server at ensembldb.ensembl.org.

Published in BMC Genomics

ISSN: 1471-2164 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Technology: Chemical technology: Biotechnology; Science: Biology (General): Genetics
Website: http://bmcgenomics.biomedcentral.com

About the journal