BMC Genomics (Aug 2012)

Pacific biosciences sequencing technology for genotyping and variation discovery in human data

  • Carneiro Mauricio O,
  • Russ Carsten,
  • Ross Michael G,
  • Gabriel Stacey B,
  • Nusbaum Chad,
  • DePristo Mark A

DOI
https://doi.org/10.1186/1471-2164-13-375
Journal volume & issue
Vol. 13, no. 1
p. 375

Abstract

Read online

Abstract Background Pacific Biosciences technology provides a fundamentally new data type that provides the potential to overcome some limitations of current next generation sequencing platforms by providing significantly longer reads, single molecule sequencing, low composition bias and an error profile that is orthogonal to other platforms. With these potential advantages in mind, we here evaluate the utility of the Pacific Biosciences RS platform for human medical amplicon resequencing projects. Results We evaluated the Pacific Biosciences technology for SNP discovery in medical resequencing projects using the Genome Analysis Toolkit, observing high sensitivity and specificity for calling differences in amplicons containing known true or false SNPs. We assessed data quality: most errors were indels (~14%) with few apparent miscalls (~1%). In this work, we define a custom data processing pipeline for Pacific Biosciences data for human data analysis. Conclusion Critically, the error properties were largely free of the context-specific effects that affect other sequencing technologies. These data show excellent utility for follow-up validation and extension studies in human data and medical genetics projects, but can be extended to other organisms with a reference genome.