GigaByte (Oct 2024)

CannSeek? Yes we Can! An open-source single nucleotide polymorphism database and analysis portal for Cannabis sativa

  • Locedie Mansueto ,
  • Kenneth L. McNally ,
  • Tobias Kretzschmar ,
  • Ramil Mauleon

DOI
https://doi.org/10.46471/gigabyte.135

Abstract

Read online

A growing interest in Cannabis sativa uses for food, fiber, and medicine, and recent changes in regulations have spurred numerous genomic studies of this once-prohibited plant. Cannabis research uses Next Generation Sequencing technologies for genomics and transcriptomics. While other crops have genome portals enabling access and analysis of numerous genotyping data from diverse accessions, leading to the discovery of alleles for important traits, this is absent for cannabis. The CannSeek web portal aims to address this gap. Single nucleotide polymorphism datasets were generated by identifying genome variants from public resequencing data and genome assemblies. Results and accompanying trait data are hosted in the CannSeek web application, built using the Rice SNP-Seek infrastructure with improvements to allow multiple reference genomes and provide a web-service Application Programming Interface. The tools built into the portal allow phylogenetic analyses, varietal grouping and identifications, and favorable haplotype discovery for cannabis accessions using public sequencing data. Availability and implementation The CannSeek portal is available at https://icgrc.info/cannseek, https://icgrc.info/genotype_viewer.