Journal of Marine Science and Engineering (Dec 2022)
A CHEMTAX Study Based on Picoeukaryotic Phytoplankton Pigments and Next-Generation Sequencing Data from the Ulleungdo–Dokdo Marine System of the East Sea (Japan Sea): Improvement of Long-Unresolved Underdetermined Bias
Abstract
The CHEMTAX program has been widely used to estimate community composition based on major pigment concentrations in seawater. However, because CHEMTAX is an underdetermined optimization algorithm, underdetermined bias has remained an unsolved problem since its development in 1996. The risk of producing biased results increases when analyzing the picophytoplankton community; therefore, this study tested a new method for avoiding biased CHEMTAX results using the picophytoplankton community around the East Sea (Japan Sea). This method involves building a linear model between pigment concentration data and community composition data based on DNA sequencing to predict the pigment range for each operational taxonomic unit, based on the 95% prediction interval. Finally, the range data are transformed into an initial ratio and ratio limits for CHEMTAX analysis. Three combinations of initial ratios and ratio limits were tested to determine whether the modeled initial ratio and ratio limit could prevent underdetermined bias in the CHEMTAX estimates; these combinations were the modeled initial ratio and ratio limit, the modeled initial ratio with a default ratio limit of 500 s, and an initial ratio from previous research with the default ratio limit. The final ratio and composition data for each combination were compared with Bayesian compositional estimator-based final ratio and composition data, which are robust against underdetermined bias. Only CHEMTAX analysis using the modeled initial ratio and ratio limit was unbiased; all other combinations showed significant signs of bias. Therefore, the findings in this study indicate that ratio limits and the initial ratio are equally important in the CHEMTAX analysis of biased datasets. Moreover, we obtained statistically supported initial ratios and ratio limits through linear modeling of pigment concentrations and 16s rDNA composition data.
Keywords