Data in Brief (Dec 2022)
Heavy metal tolerance in Scopelophila cataractae: Transcriptomic and epigenetic datasets
Abstract
Studying how different plant groups deal with heavy metal exposure is crucial to improve our understanding of the diversity of molecular mechanisms involved in plant stress response. Here, we used RNA sequencing (RNA-seq) and epigenotyping by sequencing (epiGBS) to assess gene expression and DNA methylation changes respectively in plants from four populations of the metallophyte moss Scopelophila cataractae treated with Cd or Cu in the laboratory. We built RNA-seq and epiGBS sequencing libraries from control and treated samples from each population and sequenced them using Illumina HiSeq 3000 (PE-150 bp) and Illumina HiSeq X-Ten System (PE-150 bp) respectively. For the RNA-seq data, we performed a read quality filter, mapped the reads to the de novo transcriptome created with Trinity, and estimated transcript abundance for each sample. For the epiGBS data, we used a custom pipeline (https://doi.org/10.5281/zenodo.7040291) to map the reads to a de novo reference genome and performed strand-specific nucleotide (single nucleotide polymorphisms, SNPs) and methylation (single cytosine methylation polymorphisms, SMPs) variant calling. We filtered out SNPs and SMPs with low coverage within (positions with <10 sequencing reads per sample) and across samples (positions with poor representation on the full set of samples). Finally, we performed pairwise comparisons between control and treated samples from each population and identified differentially expressed genes and differentially methylated cytosines associated to heavy metal exposure. We payed particular attention to the different responses of the more and the less tolerant populations of S. cataractae. These datasets could contribute to future comparative studies of abiotic stress response across plant groups.