Geoscientific Model Development (Jan 2021)

Coordinating an operational data distribution network for CMIP6 data

  • R. Petrie,
  • S. Denvil,
  • S. Ames,
  • G. Levavasseur,
  • S. Fiore,
  • S. Fiore,
  • C. Allen,
  • F. Antonio,
  • K. Berger,
  • P.-A. Bretonnière,
  • L. Cinquini,
  • E. Dart,
  • P. Dwarakanath,
  • K. Druken,
  • B. Evans,
  • L. Franchistéguy,
  • S. Gardoll,
  • E. Gerbier,
  • M. Greenslade,
  • D. Hassell,
  • A. Iwi,
  • M. Juckes,
  • S. Kindermann,
  • L. Lacinski,
  • M. Mirto,
  • A. B. Nasser,
  • P. Nassisi,
  • E. Nienhouse,
  • S. Nikonov,
  • A. Nuzzo,
  • C. Richards,
  • S. Ridzwan,
  • M. Rixen,
  • K. Serradell,
  • K. Snow,
  • A. Stephens,
  • M. Stockhause,
  • H. Vahlenkamp,
  • R. Wagner

DOI
https://doi.org/10.5194/gmd-14-629-2021
Journal volume & issue
Vol. 14
pp. 629 – 644

Abstract

Read online

The distribution of data contributed to the Coupled Model Intercomparison Project Phase 6 (CMIP6) is via the Earth System Grid Federation (ESGF). The ESGF is a network of internationally distributed sites that together work as a federated data archive. Data records from climate modelling institutes are published to the ESGF and then shared around the world. It is anticipated that CMIP6 will produce approximately 20 PB of data to be published and distributed via the ESGF. In addition to this large volume of data a number of value-added CMIP6 services are required to interact with the ESGF; for example the citation and errata services both interact with the ESGF but are not a core part of its infrastructure. With a number of interacting services and a large volume of data anticipated for CMIP6, the CMIP Data Node Operations Team (CDNOT) was formed. The CDNOT coordinated and implemented a series of CMIP6 preparation data challenges to test all the interacting components in the ESGF CMIP6 software ecosystem. This ensured that when CMIP6 data were released they could be reliably distributed.