MLAir (v1.0) – a tool to enable fast and flexible machine learning on air data time series

L. H. Leufen; L. H. Leufen; F. Kleinert; F. Kleinert; M. G. Schultz

doi:10.5194/gmd-14-1553-2021

Geoscientific Model Development (Mar 2021)

MLAir (v1.0) – a tool to enable fast and flexible machine learning on air data time series

L. H. Leufen,
L. H. Leufen,
F. Kleinert,
F. Kleinert,
M. G. Schultz

Affiliations

L. H. Leufen: Jülich Supercomputing Centre, Research Centre Jülich, Jülich, Germany
L. H. Leufen: Institute of Geosciences, Rhenish Friedrich Wilhelm University of Bonn, Bonn, Germany
F. Kleinert: Jülich Supercomputing Centre, Research Centre Jülich, Jülich, Germany
F. Kleinert: Institute of Geosciences, Rhenish Friedrich Wilhelm University of Bonn, Bonn, Germany
M. G. Schultz: Jülich Supercomputing Centre, Research Centre Jülich, Jülich, Germany

DOI: https://doi.org/10.5194/gmd-14-1553-2021
Journal volume & issue: Vol. 14
pp. 1553 – 1574

Abstract

Read online

With MLAir (Machine Learning on Air data) we created a software environment that simplifies and accelerates the exploration of new machine learning (ML) models, specifically shallow and deep neural networks, for the analysis and forecasting of meteorological and air quality time series. Thereby MLAir is not developed as an abstract workflow, but hand in hand with actual scientific questions. It thus addresses scientists with either a meteorological or an ML background. Due to their relative ease of use and spectacular results in other application areas, neural networks and other ML methods are also gaining enormous momentum in the weather and air quality research communities. Even though there are already many books and tutorials describing how to conduct an ML experiment, there are many stumbling blocks for a newcomer. In contrast, people familiar with ML concepts and technology often have difficulties understanding the nature of atmospheric data. With MLAir we have addressed a number of these pitfalls so that it becomes easier for scientists of both domains to rapidly start off their ML application. MLAir has been developed in such a way that it is easy to use and is designed from the very beginning as a stand-alone, fully functional experiment. Due to its flexible, modular code base, code modifications are easy and personal experiment schedules can be quickly derived. The package also includes a set of validation tools to facilitate the evaluation of ML results using standard meteorological statistics. MLAir can easily be ported onto different computing environments from desktop workstations to high-end supercomputers with or without graphics processing units (GPUs).

Published in Geoscientific Model Development

ISSN: 1991-959X (Print); 1991-9603 (Online)
Publisher: Copernicus Publications
Country of publisher: Germany
LCC subjects: Science: Geology
Website: https://www.geoscientific-model-development.net/

About the journal