Iranian Journal of Microbiology (Apr 2021)

Bioinformatic analysis of the whole genome sequences of SARS-CoV-2 from Indonesia

  • Maria Ulfah,
  • Is Helianti

DOI
https://doi.org/10.18502/ijm.v13i2.5973
Journal volume & issue
Vol. 13, no. 2

Abstract

Read online

Background and Objectives: In first May 2020, Indonesia has been successfully submitted the first three full-length sequence of SARS-CoV-2 to GISAID database. Until September 10th, 2020, Indonesia had submitted 54 WGS. In this study, we have analyzed and annotated SARS-CoV-2 mutations in spike protein and main proteases. Materials and Methods: The Whole Genome Sequence (WGS) of Indonesia were obtained from GISAID data base. The 54 data were taken from March to September 10th, 2020. The sequences corresponded to Spike Protein (SP), 3-chymotrypsin like protease (3CLpro), and papain like protease (PLpro) were selected. The Wuhan genome was used as reference. Results: In total WGS from Indonesia, we found 5 major clades, which dominated as G clade, where the mutation of D614G was found. This D614G was identified as much as 59%, which mostly reported in late samples submitted. Beside D614G mutation, we report three unique mutations: A352S, S477I, and Q677H. Besides, some mutations were also detected in two domains that were expected to be conserved region, the main viral proteases: PLpro (P77L and V205I), 3CLpro (M49I and L50F). Conclusion: The analysis of SARS-CoV-2 from WGS Indonesia showed a high genetic variation. The diversity in SARS-CoV-2 may epidemiologically enhance virulence and transmission of this virus. The prevalence of D614G over the time in different locations, indicating that changes in this mutation may related to host infection and the viral transmission. However, some mutations that have been reported in this study were not eligible for the most stable conformation.

Keywords