Data in Brief (Oct 2020)
Seasonal Ely Copper Mine Superfund site shotgun metagenomic and metatranscriptomic data analysis
Abstract
High throughput sequencing data collected from acid rock drainage (ARD) communities can reveal the active taxonomic and functional diversity of these extreme environments, which can be exploited for bioremediation, pharmaceutical, and industrial applications. Here, we report a seasonal comparison of a microbiome and transcriptome in Ely Brook (EB-90M), a confluence of clean water and upstream tributaries that drains the Ely Copper Mine Superfund site in Vershire, VT, USA. Nucleic acids were extracted from EB-90M water and sediment followed by shotgun sequencing using the Illumina NextSeq platform. Approximately 575,933 contigs with a total length of 1.54 Gbp were generated. Contigs of at least a size of 3264 (N50) or greater represented 50% of the sequences and the longest contig was 488,568 bp in length. Using Centrifuge against the NCBI “nt” database 141 phyla, including candidate phyla, were detected. Roughly 380,000 contigs were assembled and ∼1,000,000 DNA and ∼550,000 cDNA sequences were identified and functionally annotated using the Prokka pipeline. Most expressed KEGG-annotated microbial genes were involved in amino acid metabolism and several KEGG pathways were differentially expressed between seasons. Biosynthetic gene clusters involved in secondary metabolism as well as metal- and antibiotic-resistance genes were annotated, some of which were differentially expressed, colocalized, and coexpressed. These data can be used to show how ecological stimuli, such as seasonal variations and metal concentrations, affect the ARD microbiome and select taxa to produce novel natural products. The data reported herein is supporting information for the research article “Characterization of an acid rock drainage microbiome and transcriptome at the Ely Copper Mine Superfund site” by Giddings et al. [1].