iScience (Aug 2023)
Intragenomic diversity of the V9 hypervariable domain in eukaryotes has little effect on metabarcoding
Abstract
Summary: Metabarcoding revolutionized our understanding of diversity and ecology of microorganisms in different habitats. However, it is also associated with several inherent biases, one of which is associated with intragenomic diversity of a molecular barcode. Here, we compare intragenomic variability of the V9 region of the 18S rRNA gene in 19 eukaryotic phyla abundant in marine plankton. The level of intragenomic variability is comparable across all the phyla, and in most genomes and transcriptomes one V9 sequence and one OTU is predominant. However, most of the variability observed at the barcode level is probably caused by sequencing errors and is mitigated by using a denoising tool, DADA2. The SWARM algorithm commonly used in metabarcoding studies is not optimal for collapsing genuine and erroneous sequences into a single OTU, leading to an overestimation of diversity in metabarcoding data. For an unknown reason, SWARM inflates diversity of eupelagonemids more than that of other eukaryotes.