Computational and Structural Biotechnology Journal (Jan 2023)
VTAM: A robust pipeline for validating metabarcoding data using controls
Abstract
To obtain accurate estimates for biodiversity and ecological studies, metabarcoding studies should be carefully designed to minimize both false positive (FP) and false negative (FN) occurrences. Internal controls (mock samples and negative controls), replicates, and overlapping markers allow controlling metabarcoding errors but current metabarcoding software packages do not explicitly integrate these additional experimental data to optimize filtering. We have developed the metabarcoding analysis software VTAM, which uses explicitly these elements of the experimental design to find optimal parameter settings that minimize FP and FN occurrences. VTAM showed similar sensitivity, but a higher precision compared to two other pipelines using three datasets and two different markers (COI, 16S). The stringent filtering procedure implemented in VTAM aims to produce robust metabarcoding data to obtain accurate ecological estimates and represents an important step towards a non-arbitrary and standardized validation of metabarcoding data for conducting ecological studies. VTAM is implemented in Python and available from: https://github.com/aitgon/vtam. The VTAM benchmark code is available from: https://github.com/aitgon/vtam_benchmark.