Data in Brief (Jun 2023)
Draft genome sequence data of Enterococcus faecium R9, a multiple enterocins-producing strain
Abstract
Food contamination by pathogens results in serious health problems and economic losses. Chemical food preservatives pose a risk to human health when used in food preservation. To increase the shelf life of the products and prevent spoilage, the dairy sector is considering natural preservatives such the ribosomally synthesized peptides, bacteriocins. Here we present the draft genome sequence of Enterococcus faecium strain R9 producing three bacteriocins isolated from raw camel milk. These bacteriocins showed valuable technological properties, such as sensitivity to proteolytic enzymes, heat stability, and wide range of pH tolerance. The 2 × 250 bp paired end reads sequencing was performed on Illumina HiSeq 2500 sequencing. The genome sequence consisted of 3,598,862 bases, with a GC content of 37.94% bases. The number of raw reads was 4,670,510, and the assembly N50 score was 65,355 bp with a 310.28 average coverage. A total of 3,086 coding sequences (CDSs) was predicted with 2,126 CDSs with a known function and 127 with a signal peptide. Annotation of the genome sequence revealed bacteriocins encoding genes, namely, enterocin B, enterocin P, and two-component enterocin X (X-alfa and X-beta subunits). These enterocins are beneficial for controlling Listeria monocytogenes in the food industry. Genome sequence of Enterococcus faecium R9 has been deposited at the gene bank under BioSample accession number JALJED000000000 and are available in Mendeley Data [1].