Frontiers in Microbiology (May 2021)

HCK and ABAA: A Newly Designed Pipeline to Improve Fungi Metabarcoding Analysis

  • Kodjovi D. Mlaga,
  • Alban Mathieu,
  • Alban Mathieu,
  • Charles Joly Beauparlant,
  • Charles Joly Beauparlant,
  • Alban Ott,
  • Ahmad Khodr,
  • Olivier Perin,
  • Arnaud Droit,
  • Arnaud Droit

DOI
https://doi.org/10.3389/fmicb.2021.640693
Journal volume & issue
Vol. 12

Abstract

Read online

IntroductionThe fungi ITS sequence length dissimilarity, non-specific amplicons, including chimaera formed during Polymerase Chain Reaction (PCR), added to sequencing errors, create bias during similarity clustering and abundance estimation in the downstream analysis. To overcome these challenges, we present a novel approach, Hierarchical Clustering with Kraken (HCK), to classify ITS1 amplicons and Abundance-Base Alternative Approach (ABAA) pipeline to detect and filter non-specific amplicons in fungi metabarcoding sequencing datasets.Materials and MethodsWe compared the performances of both pipelines against QIIME, KRAKEN, and DADA2 using publicly available fungi ITS mock community datasets and using BLASTn as a reference. We calculated the Precision, Recall, F-score using the True-Positive, False-positive, and False-negative estimation. Alpha diversity (Chao1 and Shannon metrics) was also used to evaluate the diversity estimation of our method.ResultsThe analysis shows that ABAA reduced the number of false-positive with all metabarcoding methods tested, and HCK increases precision and recall. HCK, coupled with ABAA, improves the F-score and bring alpha diversity metric value close to that of the BLASTn alpha diversity values when compared to QIIME, KRAKEN, and DADA2.ConclusionThe developed HCK-ABAA approach allows better identification of the fungi community structures while avoiding use of a reference database for non-specific amplicons filtration. It results in a more robust and stable methodology over time. The software can be downloaded on the following link: https://bitbucket.org/GottySG36/hck/src/master/.

Keywords