Wellcome Open Research (Aug 2020)

Integrated genomic view of SARS-CoV-2 in India [version 1; peer review: 2 approved]

  • Pramod Kumar,
  • Rajesh Pandey,
  • Pooja Sharma,
  • Mahesh S. Dhar,
  • Vivekanand A.,
  • Bharathram Uppili,
  • Himanshu Vashisht,
  • Saruchi Wadhwa,
  • Nishu Tyagi,
  • Saman Fatihi,
  • Uma Sharma,
  • Priyanka Singh,
  • Hemlata Lall,
  • Meena Datta,
  • Poonam Gupta,
  • Nidhi Saini,
  • Aarti Tewari,
  • Bibhash Nandi,
  • Dhirendra Kumar,
  • Satyabrata Bag,
  • Deepanshi Gahlot,
  • Surabhi Rathore,
  • Nidhi Jatana,
  • Varun Jaiswal,
  • Hema Gogia,
  • Preeti Madan,
  • Simrita Singh,
  • Prateek Singh,
  • Debasis Dash,
  • Manju Bala,
  • Sandhya Kabra,
  • Sujeet Singh,
  • Mitali Mukerji,
  • Lipi Thukral,
  • Mohammed Faruq,
  • Anurag Agrawal,
  • Partha Rakshit

DOI
https://doi.org/10.12688/wellcomeopenres.16119.1
Journal volume & issue
Vol. 5

Abstract

Read online

Background: India first detected SARS-CoV-2, causal agent of COVID-19 in late January 2020, imported from Wuhan, China. From March 2020 onwards, the importation of cases from countries in the rest of the world followed by seeding of local transmission triggered further outbreaks in India. Methods: We used ARTIC protocol-based tiling amplicon sequencing of SARS-CoV-2 (n=104) from different states of India using a combination of MinION and MinIT sequencing from Oxford Nanopore Technology to understand how introduction and local transmission occurred. Results: The analyses revealed multiple introductions of SARS-CoV-2 genomes, including the A2a cluster from Europe and the USA, A3 cluster from Middle East and A4 cluster (haplotype redefined) from Southeast Asia (Indonesia, Thailand and Malaysia) and Central Asia (Kyrgyzstan). The local transmission and persistence of genomes A4, A2a and A3 was also observed in the studied locations. The most prevalent genomes with patterns of variance (confined in a cluster) remain unclassified, and are here proposed as A4-clade based on its divergence within the A cluster. Conclusions: The viral haplotypes may link their persistence to geo-climatic conditions and host response. Multipronged strategies including molecular surveillance based on real-time viral genomic data is of paramount importance for a timely management of the pandemic.