Addressing challenges in the production and analysis of illumina sequencing data

Kelso Janet; Heyn Patricia; Kircher Martin

doi:10.1186/1471-2164-12-382

BMC Genomics (Jul 2011)

Addressing challenges in the production and analysis of illumina sequencing data

Kelso Janet,
Heyn Patricia,
Kircher Martin

Affiliations

Kelso Janet
Heyn Patricia
Kircher Martin

DOI: https://doi.org/10.1186/1471-2164-12-382
Journal volume & issue: Vol. 12, no. 1
p. 382

Abstract

Read online

Abstract Advances in DNA sequencing technologies have made it possible to generate large amounts of sequence data very rapidly and at substantially lower cost than capillary sequencing. These new technologies have specific characteristics and limitations that require either consideration during project design, or which must be addressed during data analysis. Specialist skills, both at the laboratory and the computational stages of project design and analysis, are crucial to the generation of high quality data from these new platforms. The Illumina sequencers (including the Genome Analyzers I/II/IIe/IIx and the new HiScan and HiSeq) represent a widely used platform providing parallel readout of several hundred million immobilized sequences using fluorescent-dye reversible-terminator chemistry. Sequencing library quality, sample handling, instrument settings and sequencing chemistry have a strong impact on sequencing run quality. The presence of adapter chimeras and adapter sequences at the end of short-insert molecules, as well as increased error rates and short read lengths complicate many computational analyses. We discuss here some of the factors that influence the frequency and severity of these problems and provide solutions for circumventing these. Further, we present a set of general principles for good analysis practice that enable problems with sequencing runs to be identified and dealt with.

Published in BMC Genomics

ISSN: 1471-2164 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Technology: Chemical technology: Biotechnology; Science: Biology (General): Genetics
Website: http://bmcgenomics.biomedcentral.com

About the journal