PLoS Computational Biology (Jun 2023)

Analysis of high-dimensional metabolomics data with complex temporal dynamics using RM-ASCA.

  • Balázs Erdős,
  • Johan A Westerhuis,
  • Michiel E Adriaens,
  • Shauna D O'Donovan,
  • Ren Xie,
  • Cécile M Singh-Povel,
  • Age K Smilde,
  • Ilja C W Arts

DOI
https://doi.org/10.1371/journal.pcbi.1011221
Journal volume & issue
Vol. 19, no. 6
p. e1011221

Abstract

Read online

The intricate dependency structure of biological "omics" data, particularly those originating from longitudinal intervention studies with frequently sampled repeated measurements renders the analysis of such data challenging. The high-dimensionality, inter-relatedness of multiple outcomes, and heterogeneity in the studied systems all add to the difficulty in deriving meaningful information. In addition, the subtle differences in dynamics often deemed meaningful in nutritional intervention studies can be particularly challenging to quantify. In this work we demonstrate the use of quantitative longitudinal models within the repeated-measures ANOVA simultaneous component analysis+ (RM-ASCA+) framework to capture the dynamics in frequently sampled longitudinal data with multivariate outcomes. We illustrate the use of linear mixed models with polynomial and spline basis expansion of the time variable within RM-ASCA+ in order to quantify non-linear dynamics in a simulation study as well as in a metabolomics data set. We show that the proposed approach presents a convenient and interpretable way to systematically quantify and summarize multivariate outcomes in longitudinal studies while accounting for proper within subject dependency structures.