Journal of Open Research Software (Mar 2019)

'gcamdata': An R Package for Preparation, Synthesis, and Tracking of Input Data for the GCAM Integrated Human-Earth Systems Model

  • Ben Bond-Lamberty,
  • Kalyn Dorheim,
  • Ryna Cui,
  • Russell Horowitz,
  • Abigail Snyder,
  • Katherine Calvin,
  • Leyang Feng,
  • Rachel Hoesly,
  • Jill Horing,
  • G. Page Kyle,
  • Robert Link,
  • Pralit Patel,
  • Christopher Roney,
  • Aaron Staniszewski,
  • Sean Turner,
  • Min Chen,
  • Felip Feijoo,
  • Corinne Hartin,
  • Mohamad Hejazi,
  • Gokul Iyer,
  • Sonny Kim,
  • Yaling Liu,
  • Cary Lynch,
  • Haewon McJeon,
  • Steven Smith,
  • Stephanie Waldhoff,
  • Marshall Wise,
  • Leon Clarke

DOI
https://doi.org/10.5334/jors.232
Journal volume & issue
Vol. 7, no. 1

Abstract

Read online

The increasing data requirements of complex models demand robust, reproducible, and transparent systems to track and prepare models’ inputs. Here we describe version 1.0 of the gcamdata R package that processes raw inputs to produce the hundreds of XML files needed by the GCAM integrated human-earth systems model. It features extensive functional and unit testing, data tracing and visualization, and enforces metadata, documentation, and flexibility in its component data-processing subunits. Although this package is specific to GCAM, many of its structural pieces and approaches should be broadly applicable to, and reusable by, other complex model/data systems aiming to improve transparency, reproducibility, and flexibility. Funding statement: Primary support for this work was provided by the U.S. Department of Energy, Office of Science, as part of research in Multi-Sector Dynamics, Earth and Environmental System Modeling Program. Additional support was provided by the U.S. Department of Energy Offices of Fossil Energy, Nuclear Energy, and Energy Efficiency and Renewable Energy and the U.S. Environmental Protection Agency.

Keywords