Journal of Infection and Public Health (Aug 2022)

A comprehensive genomic study, mutation screening, phylogenetic and statistical analysis of SARS-CoV-2 and its variant omicron among different countries

  • Syed Umair Ahmad,
  • Bushra Hafeez Kiani,
  • Muhammad Abrar,
  • Zainab Jan,
  • Imran Zafar,
  • Yasir Ali,
  • Amer M. Alanazi,
  • Abdul Malik,
  • Mohd Ashraf Rather,
  • Asrar Ahmad,
  • Azmat Ali Khan

Journal volume & issue
Vol. 15, no. 8
pp. 878 – 891

Abstract

Read online

Background: With the rapid development of the genomic sequence data for the Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and its variants Delta (B.1.617.2) and Omicron (B.1.1.529), it is vital to successfully identify mutations within the genome. Objective: The main objective of the study is to investigate the full-length genome mutation analysis of 157 SARS-CoV-2 and its variant Delta and Omicron isolates. This study also provides possible effects at the structural level to understand the role of mutations and new insights into the evolution of COVID-19 and evaluates the differential level analysis in viral genome sequence among different nations. We have also tried to offer a mutation snapshot for these differences that could help in vaccine formulation. This study utilizes a unique and efficient method of targeting the stable genes for the drug discovery approach. Methods: Complete genome sequence information of SARS-CoV-2, Delta, and Omicron from online resources were used to predict structure domain identification, data mining, and screening; employing different bioinformatics tools. BioEdit software was used to perform their genomic alignments across countries and a phylogenetic tree as per the confidence of 500 bootstrapping values was constructed. Heterozygosity ratios were determined in-silico. A minimum spanning network (MSN) of selected populations was determined by Bruvo's distance role-based framework. Results: Out of all 157 different strains of SARS-CoV-2 and its variants, and their complete genome sequences from different countries, Corona nucleoca and DUF5515 were observed to be the most conserved domains. All genomes obtained changes in comparison to the Wuhan-Hu-1 strain, mainly in the TRS region (CUAAAC or ACGAAC). We discovered 596 mutations in all genes, with the highest number (321) found in ORF1ab (QHD43415.1), or TRS site mutations found only in ORF7a (1) and ORF10 (2). The Omicron variant has 30 mutations in the Spike protein and has a higher alpha-helix shape (23.46%) than the Delta version (22.03%). T478 was also discovered to be a prevalent polymorphism in Delta and Omicron variations, as well as genomic gaps ranging from 45 to 65aa. All 157 sequences contained variations and conformed to Nei's Genetic distance. We discovered heterozygosity (Hs) 0.01, mean anticipated Hs 0.32, the genetic diversity index (GDI) 0.01943989, and GD within population 0.01266951. The Hedrick value was 0.52324978, the GD coefficient was 0.52324978, the average Hs was 0.01371452, and the GD coefficient was 0.52324978. Among other countries, Brazil has the highest standard error (SE) rate (1.398), whereas Japan has the highest ratio of Nei's gene diversity (0.01). Conclusions: The study's findings will assist in comprehending the shape and kind of complete genome, their streaming genomic sequences, and mutations in various additions of SARS-CoV-2, as well as its different variant strains like Omicron. These results will provide a scientific basis to design the vaccines and understand the genomic study of these viruses.

Keywords