International Journal of Molecular Sciences (Oct 2023)

Whole-Genome Sequencing of 502 Individuals from Latvia: The First Step towards a Population-Specific Reference of Genetic Variation

  • Raimonds Reščenko,
  • Monta Brīvība,
  • Ivanna Atava,
  • Vita Rovīte,
  • Raitis Pečulis,
  • Ivars Silamiķelis,
  • Laura Ansone,
  • Kaspars Megnis,
  • Līga Birzniece,
  • Mārcis Leja,
  • Liqin Xu,
  • Xulian Shi,
  • Yan Zhou,
  • Andis Slaitas,
  • Yong Hou,
  • Jānis Kloviņš

DOI
https://doi.org/10.3390/ijms242015345
Journal volume & issue
Vol. 24, no. 20
p. 15345

Abstract

Read online

Despite rapid improvements in the accessibility of whole-genome sequencing (WGS), understanding the extent of human genetic variation is limited by the scarce availability of genome sequences from underrepresented populations. Developing the population-scale reference database of Latvian genetic variation may fill the gap in European genomes and improve human genomics research. In this study, we analysed a high-coverage WGS dataset comprising 502 individuals selected from the Genome Database of the Latvian Population. An assessment of variant type, location in the genome, function, medical relevance, and novelty was performed, and a population-specific imputation reference panel (IRP) was developed. We identified more than 18.2 million variants in total, of which 3.3% so far are not represented in gnomAD and dbSNP databases. Moreover, we observed a notable though distinct clustering of the Latvian cohort within the European subpopulations. Finally, our findings demonstrate the improved performance of imputation of variants using the Latvian population-specific reference panel in the Latvian population compared to established IRPs. In summary, our study provides the first WGS data for a regional reference genome that will serve as a resource for the development of precision medicine and complement the global genome dataset, improving the understanding of human genetic variation.

Keywords