Increasing the Reproducibility and Replicability of Supervised AI/ML in the Earth Systems Science by Leveraging Social Science Methods

Christopher D. Wirz; Carly Sutter; Julie L. Demuth; Kirsten J. Mayer; William E. Chapman; Mariana Goodall Cains; Jacob Radford; Vanessa Przybylo; Aaron Evans; Thomas Martin; Lauriana C. Gaudet; Kara Sulia; Ann Bostrom; David John Gagne II; Nick Bassill; Andrea Schumacher; Christopher Thorncroft

doi:10.1029/2023EA003364

Earth and Space Science (Jul 2024)

Increasing the Reproducibility and Replicability of Supervised AI/ML in the Earth Systems Science by Leveraging Social Science Methods

Christopher D. Wirz,
Carly Sutter,
Julie L. Demuth,
Kirsten J. Mayer,
William E. Chapman,
Mariana Goodall Cains,
Jacob Radford,
Vanessa Przybylo,
Aaron Evans,
Thomas Martin,
Lauriana C. Gaudet,
Kara Sulia,
Ann Bostrom,
David John Gagne II,
Nick Bassill,
Andrea Schumacher,
Christopher Thorncroft

Affiliations

Christopher D. Wirz: NSF National Center for Atmospheric Research Boulder CO USA
Carly Sutter: University at Albany SUNY Albany NY USA
Julie L. Demuth: NSF National Center for Atmospheric Research Boulder CO USA
Kirsten J. Mayer: NSF National Center for Atmospheric Research Boulder CO USA
William E. Chapman: NSF National Center for Atmospheric Research Boulder CO USA
Mariana Goodall Cains: NSF National Center for Atmospheric Research Boulder CO USA
Jacob Radford: NSF National Center for Atmospheric Research Boulder CO USA
Vanessa Przybylo: University at Albany SUNY Albany NY USA
Aaron Evans: University at Albany SUNY Albany NY USA
Thomas Martin: NSF Unidata Boulder CO USA
Lauriana C. Gaudet: The Weather Company Andover MA USA
Kara Sulia: University at Albany SUNY Albany NY USA
Ann Bostrom: University of Washington Seattle WA USA
David John Gagne II: NSF National Center for Atmospheric Research Boulder CO USA
Nick Bassill: University at Albany SUNY Albany NY USA
Andrea Schumacher: NSF National Center for Atmospheric Research Boulder CO USA
Christopher Thorncroft: University at Albany SUNY Albany NY USA

DOI: https://doi.org/10.1029/2023EA003364
Journal volume & issue: Vol. 11, no. 7
pp. n/a – n/a

Abstract

Read online

Abstract Artificial intelligence (AI) and machine learning (ML) pose a challenge for achieving science that is both reproducible and replicable. The challenge is compounded in supervised models that depend on manually labeled training data, as they introduce additional decision‐making and processes that require thorough documentation and reporting. We address these limitations by providing an approach to hand labeling training data for supervised ML that integrates quantitative content analysis (QCA)—a method from social science research. The QCA approach provides a rigorous and well‐documented hand labeling procedure to improve the replicability and reproducibility of supervised ML applications in Earth systems science (ESS), as well as the ability to evaluate them. Specifically, the approach requires (a) the articulation and documentation of the exact decision‐making process used for assigning hand labels in a “codebook” and (b) an empirical evaluation of the reliability” of the hand labelers. In this paper, we outline the contributions of QCA to the field, along with an overview of the general approach. We then provide a case study to further demonstrate how this framework has and can be applied when developing supervised ML models for applications in ESS. With this approach, we provide an actionable path forward for addressing ethical considerations and goals outlined by recent AGU work on ML ethics in ESS.

Published in Earth and Space Science

ISSN: 2333-5084 (Online)
Publisher: American Geophysical Union (AGU)
Country of publisher: United States
LCC subjects: Science: Astronomy; Science: Geology
Website: https://agupubs.onlinelibrary.wiley.com/journal/23335084

About the journal

Abstract

Keywords