Scientific Reports (Feb 2021)
A target enrichment high throughput sequencing system for characterization of BLV whole genome sequence, integration sites, clonality and host SNP
Abstract
Abstract Bovine leukemia virus (BLV) is an oncogenic retrovirus which induces malignant lymphoma termed enzootic bovine leukosis (EBL) after a long incubation period. Insertion sites of the BLV proviral genome as well as the associations between disease progression and polymorphisms of the virus and host genome are not fully understood. To characterize the biological coherence between virus and host, we developed a DNA-capture-seq approach, in which DNA probes were used to efficiently enrich target sequence reads from the next-generation sequencing (NGS) library. In addition, enriched reads can also be analyzed for detection of proviral integration sites and clonal expansion of infected cells since the reads include chimeric reads of the host and proviral genomes. To validate this DNA-capture-seq approach, a persistently BLV-infected fetal lamb kidney cell line (FLK-BLV), four EBL tumor samples and four non-EBL blood samples were analyzed to identify BLV integration sites. The results showed efficient enrichment of target sequence reads and oligoclonal integrations of the BLV proviral genome in the FLK-BLV cell line. Moreover, three out of four EBL tumor samples displayed multiple integration sites of the BLV proviral genome, while one sample displayed a single integration site. In this study, we found the evidence for the first time that the integrated provirus defective at the 5′ end was present in the persistent lymphocytosis cattle. The efficient and sensitive identification of BLV variability, integration sites and clonal expansion described in this study provide support for use of this innovative tool for understanding the detailed mechanisms of BLV infection during the course of disease progression.