Technology to aid the analysis of large-volume multi-institute climate model output at a central analysis facility (PRIMAVERA Data Management Tool V2.10)

J. Seddon; A. Stephens; M. S. Mizielinski; P. L. Vidale; M. J. Roberts

doi:10.5194/gmd-16-6689-2023

Geoscientific Model Development (Nov 2023)

Technology to aid the analysis of large-volume multi-institute climate model output at a central analysis facility (PRIMAVERA Data Management Tool V2.10)

J. Seddon,
A. Stephens,
M. S. Mizielinski,
P. L. Vidale,
M. J. Roberts

Affiliations

J. Seddon: Met Office Hadley Centre, FitzRoy Road, Exeter, EX1 3PB, UK
A. Stephens: Centre for Environmental Data Analysis, RAL Space, STFC Rutherford Appleton Laboratory, Harwell, Oxford, Didcot, OX11 0QX, UK
M. S. Mizielinski: Met Office Hadley Centre, FitzRoy Road, Exeter, EX1 3PB, UK
P. L. Vidale: NCAS-Climate, Department of Meteorology, University of Reading, Reading, RG6 6AH, UK
M. J. Roberts: Met Office Hadley Centre, FitzRoy Road, Exeter, EX1 3PB, UK

DOI: https://doi.org/10.5194/gmd-16-6689-2023
Journal volume & issue: Vol. 16
pp. 6689 – 6700

Abstract

Read online

The PRIMAVERA project aimed to develop a new generation of advanced and well-evaluated high-resolution global climate models. As part of PRIMAVERA, seven different climate models were run in both standard and higher-resolution configurations, with common initial conditions and forcings to form a multi-model ensemble. The ensemble simulations were run on high-performance computers across Europe and generated approximately 1.6 PiB (pebibytes) of output. To allow the data from all models to be analysed at this scale, PRIMAVERA scientists were encouraged to bring their analysis to the data. All data were transferred to a central analysis facility (CAF), in this case the JASMIN super-data-cluster, where it was catalogued and details made available to users using the web interface of the PRIMAVERA Data Management Tool (DMT). Users from across the project were able to query the available data using the DMT and then access it at the CAF. Here we describe how the PRIMAVERA project used the CAF's facilities to enable users to analyse this multi-model dataset. We believe that PRIMAVERA's experience using a CAF demonstrates how similar, multi-institute, big-data projects can efficiently share, organise and analyse large volumes of data.

Published in Geoscientific Model Development

ISSN: 1991-959X (Print); 1991-9603 (Online)
Publisher: Copernicus Publications
Country of publisher: Germany
LCC subjects: Science: Geology
Website: https://www.geoscientific-model-development.net/

About the journal