Data in Brief (Jun 2024)

Genome features and carbohydrate-active enzymes repertoire of a novel Stenotrophomonas sepilia Alg010 strain isolated from Sargassum seaweed waste

  • Bidyut R. Mohapatra

Journal volume & issue
Vol. 54
p. 110533

Abstract

Read online

This study reports the genome sequence data of a novel Stenotrophomonas sepilia Alg010 strain isolated from Sargassum seaweed waste accumulated on the coastline of Barbados. The genome sequence data was obtained via sequencing of the genomic DNA of this isolate with Illumina NextSeq2000 platform and paired-end library preparation protocol. The resulting reads were assembled with the SPAdes Genome Assembler (ver 3.15.4) and annotated with the DDBJ Fast Annotation and Submission Tool. The genome size of this novel isolate was recorded as 4,515,447 bp with a coverage of 270×, a GC content of 66.6 % and a gap ratio of 0.027 %. The lengths of the longest and the N50 contigs were estimated as 246,749 bp and 81,982 bp, respectively. The genome contains 2 rRNA, 66 tRNA, 2 CRISPR, 86 contigs and 4024 CDSs (coding sequences) with a coding ratio of 88.9 %. The annotation of the CDSs for COG (cluster of orthologous groups) and for subsystem features indicated that the metabolism and the amino acids and derivatives were the most dominant categories, respectively. The annotation of the genome via dbCAN3 server for carbohydrate-active genes revealed 98 genes encoding the six functional classes of carbohydrate-active enzymes. The genome sequence data is available in NCBI GenBank with the accession number BTRJ00000000.

Keywords