F1000Research (Sep 2018)

The ICR142 NGS validation series: a resource for orthogonal assessment of NGS analysis [version 2; referees: 2 approved]

  • Elise Ruark,
  • Anthony Renwick,
  • Matthew Clarke,
  • Katie Snape,
  • Emma Ramsay,
  • Anna Elliott,
  • Sandra Hanks,
  • Ann Strydom,
  • Sheila Seal,
  • Nazneen Rahman

DOI
https://doi.org/10.12688/f1000research.8219.2
Journal volume & issue
Vol. 5

Abstract

Read online

To provide a useful community resource for orthogonal assessment of NGS analysis software, we present the ICR142 NGS validation series. The dataset includes high-quality exome sequence data from 142 samples together with Sanger sequence data at 704 sites; 416 sites with variants and 288 sites at which variants were called by an NGS analysis tool, but no variant is present in the corresponding Sanger sequence. The dataset includes 293 indel variants and 247 negative indel sites, and thus the ICR142 validation dataset is of particular utility in evaluating indel calling performance. The FASTQ files and Sanger sequence results can be accessed in the European Genome-phenome Archive under the accession number EGAS00001001332.

Keywords