COVID (Apr 2023)
Genetic Analysis and Epitope Prediction of SARS-CoV-2 Genome in Bahia, Brazil: An In Silico Analysis of First and Second Wave Genomics Diversity
Abstract
COVID-19 is an infectious disease caused by SARS-CoV-2. This virus presents high levels of mutation and transmissibility, which contributed to the emergence of the pandemic. Our study aimed to analyze, in silico, the genomic diversity of SARS-CoV-2 strains in Bahia State by comparing patterns in variability of strains circulating in Brazil with the first isolated strain NC_045512 (reference sequence). Genomes were collected using GISAID, and subsequently aligned and compared using structural and functional genomic annotation. A total of 744 genomes were selected, and 20,773 mutations were found, most of which were of the SNP type. Most of the samples presented low mutational impact, and of the samples, the P.1 (360) lineage possessed the highest prevalence. The most prevalent epitopes were associated with the ORF1ab protein, and in addition to P.1, twenty-one other lineages were also detected during the study period, notably B.1.1.33 (78). The phylogenetic tree revealed that SARS-CoV-2 variants isolated from Bahia were clustered closely together. It is expected that the data collected will help provide a better epidemiological understanding of the COVID-19 pandemic (especially in Bahia), as well as helping to develop more effective vaccines that allow less immunogenic escape.
Keywords