Earth System Science Data (Nov 2023)

A global daily gap-filled chlorophyll-<i>a</i> dataset in open oceans during 2001–2021 from multisource information using convolutional neural networks

  • Z. Hong,
  • Z. Hong,
  • D. Long,
  • D. Long,
  • X. Li,
  • X. Li,
  • Y. Wang,
  • Y. Wang,
  • J. Zhang,
  • J. Zhang,
  • M. A. Hamouda,
  • M. A. Hamouda,
  • M. M. Mohamed,
  • M. M. Mohamed

DOI
https://doi.org/10.5194/essd-15-5281-2023
Journal volume & issue
Vol. 15
pp. 5281 – 5300

Abstract

Read online

Ocean color data are essential for developing our understanding of biological and ecological phenomena and processes and also of important sources of input for physical and biogeochemical ocean models. Chlorophyll-a (Chl-a) is a critical variable of ocean color in the marine environment. Quantitative retrieval from satellite remote sensing is a main way to obtain large-scale oceanic Chl-a. However, missing data are a major limitation in satellite remote-sensing-based Chl-a products due mostly to the influence of cloud, sun glint contamination, and high satellite viewing angles. The common methods to reconstruct (gap fill) missing data often consider spatiotemporal information of initial images alone, such as Data Interpolating Empirical Orthogonal Functions, optimal interpolation, Kriging interpolation, and the extended Kalman filter. However, these methods do not perform well in the presence of large-scale missing values in the image and overlook the valuable information available from other datasets for data reconstruction. Here, we developed a convolutional neural network (CNN) named Ocean Chlorophyll-a concentration reconstruction by convolutional neural NETwork (OCNET) for Chl-a concentration data reconstruction in open-ocean areas, considering environmental variables that are associated with ocean phytoplankton growth and distribution. Sea surface temperature (SST), salinity (SAL), photosynthetically active radiation (PAR), and sea surface pressure (SSP) from reanalysis data and satellite observations were selected as the input of OCNET to correlate with the environment and phytoplankton biomass. The developed OCNET model achieves good performance in the reconstruction of global open ocean Chl-a concentration data and captures spatiotemporal variations of these features. The reconstructed Chl-a data are available online at https://doi.org/10.5281/zenodo.10011908 (Hong et al., 2023). This study also shows the potential of machine learning in large-scale ocean color data reconstruction and offers the possibility of predicting Chl-a concentration trends in a changing environment.