BMC Genomics (Mar 2019)
Universal human papillomavirus typing by whole genome sequencing following target enrichment: evaluation of assay reproducibility and limit of detection
Abstract
Abstract Background We recently described a method for unbiased detection of all known human papillomaviruses (HPV) types with the potential for the determination of their variant and integration from the resulting whole genome sequence data. Considering the complex workflow for target-enriched next generation sequencing (NGS), we focused on the reproducibility and limit of detection (LOD) of this new universal HPV typing assay in this study. Results We evaluated the reproducibility and LOD for HPV genotyping based on our recently published method that used RNA-baits targeting whole genomes of 191 HPV types, Agilent SureSelect protocol for target enrichment and Illumina HiSeq 2500 for sequencing (eWGS, enriched whole genome sequencing). Two libraries, prepared from pooled plasmids representing 9 vaccine HPV types at varying input (1–625 copies/reaction), were sequenced twice giving four replicates for evaluating reproducibility and LOD. eWGS showed high correlation in the number of reads mapped to HPV reference genomes between the two flow-cell lanes within (R2 = 1) and between experiments (R2 = 0.99). The number of mapped reads was positively correlated to copy number (β = 13.9, p < 0.0001). The limit of blank (LOB) could be calculated based on mapped reads to HPV types not included in each sample. HPV genotyping was reproducible for all 9 types at 625 copies using multiple cut-off criteria but LOD was 25 copies based on number of reads above LOB even when multiple types were present. eWGS showed no bias for HPV genotyping under single or multiple infection (p = 0.16–0.99). Conclusions The universal eWGS method for HPV genotyping has sensitivity, competitive with widely used consensus PCR methods with reduced type competition, and with the potential for determination of variant and integration status. The protocol used in this study, using defined samples varying in complexity and copy number, analyzed in replicate and duplicate assays, is applicable to most WGS methods.
Keywords