Frontiers in Plant Science (Feb 2012)

Metabolomics as a hypothesis-generating functional genomics tool for the annotation of Arabidopsis thaliana genes of "unknown function"

  • Stephanie Michelle Moon Quanbeck,
  • Libuse eBrachova,
  • Alexis A. Campbell,
  • Xin eGuan,
  • Ann ePerera,
  • Kun eHe,
  • Seung Y. Rhee,
  • Preeti eBais,
  • Julie eDickerson,
  • Philip eDixon,
  • Gert eWohlgemuth,
  • Oliver eFiehn,
  • Lenore eBarkan,
  • B. Markus eLange,
  • Insuk eLee,
  • Diego eCortes,
  • Carolina eSalazar,
  • Joel eShuman,
  • Vladimir eShulaev,
  • Vladimir eShulaev,
  • David eHuhman,
  • Lloyd W. Sumner,
  • Mary R. Roth,
  • Ruth eWelti,
  • Hilal eIlarslan,
  • Eve S. Wurtele,
  • Basil J. Nikolau

DOI
https://doi.org/10.3389/fpls.2012.00015
Journal volume & issue
Vol. 3

Abstract

Read online

Metabolomics is the methodology that identifies and measures global pools of small molecules (of less than about 1,000 Daltons) of a biological sample, which are collectively called the metabolome. Metabolomics can therefore reveal the metabolic outcome of a genetic or environmental perturbation of a metabolic regulatory network, and thus provide insights into the structure and regulation of that network. Because of the chemical complexity of the metabolome and limitations associated with individual analytical platforms for determining the metabolome, it is currently difficult to capture the complete metabolome of an organism or tissue, which is in contrast to genomics and transcriptomics. This paper describes the analysis of Arabidopsis metabolomics data sets acquired by a consortium that includes five analytical laboratories, bioinformatists and biostatisticians, which aims to develop and validate metabolomics as a hypothesis-generating functional genomics tool. The consortium is determining the metabolomes of Arabidopsis T-DNA mutant stocks, grown in standardized controlled environment optimized to minimize environmental impacts on the metabolomes. Metabolomics data were generated with seven analytical platforms, and the combined data is being provided to the research community to formulate initial hypotheses about genes of unknown function. A public database (www.PlantMetabolomics.org) has been developed to provide the scientific community with access to the data along with tools to allow for its interactive analysis. Exemplary datasets are discussed to validate the approach, which illustrate how initial hypotheses can be generated from the consortium-produced metabolomics data, integrated with prior knowledge to provide a testable hypothesis concerning the functionality of genes of unknown functions.

Keywords