Earth System Science Data (Jul 2024)

Gap-filling techniques applied to the GOCI-derived daily sea surface salinity product for the Changjiang diluted water front in the East China Sea

  • J. Shin,
  • D.-W. Kim,
  • D.-W. Kim,
  • S.-H. Kim,
  • G. S. Lee,
  • B.-K. Khim,
  • B.-K. Khim,
  • Y.-H. Jo,
  • Y.-H. Jo

DOI
https://doi.org/10.5194/essd-16-3193-2024
Journal volume & issue
Vol. 16
pp. 3193 – 3211

Abstract

Read online

The spatial and temporal resolutions of contemporary microwave-based sea surface salinity (SSS) measurements are insufficient. Thus, we developed a gap-free gridded daily SSS product with higher spatial and temporal resolutions, which can provide information on short-term variability in the East China Sea (ECS), such as the front changes by Changjiang diluted water (CDW). Specifically, we conducted gap-filling for daily SSS products based on the Geostationary Ocean Color Imager (GOCI) with a spatial resolution of 1 km (0.01°), using a machine learning approach during the summer seasons from 2015 to 2019. The comparison of the Soil Moisture Active Passive (SMAP), Copernicus Marine Environment Monitoring Service (CMEMS), and Hybrid Coordinate Ocean Model (HYCOM) SSS products with the GOCI-derived SSS over the entire SSS range showed that the SMAP SSS was highly consistent, whereas the HYCOM SSS was the least consistent. In the < 31 psu range, the SMAP SSS was still the most consistent with the GOCI-derived SSS (R2=0.46; root mean squared error: RMSE = 2.41 psu); in the > 31 psu range, the CMEMS and HYCOM SSS products showed similar levels of agreement with that of the SMAP SSS. We trained and tested three machine learning models – the fine trees, boosted trees, and bagged trees models – using the daily GOCI-derived SSS as output, including the three SSS products, environmental variables, and geographical data. We combined the three SSS products to construct input datasets for machine learning. Using the test dataset, the bagged trees model showed the best results (mean R2=0.98 and RMSE = 1.31 psu), and the models that used the SMAP SSS as input had the highest level. For the dataset in the > 31 psu range, all the models exhibited similarly reasonable performances (RMSE = 1.25–1.35 psu). The comparison with in situ SSS data, time series analysis, and the spatial SSS distribution derived from models showed that all the models had proper CDW distributions with reasonable RMSE levels (0.91–1.56 psu). In addition, the CDW front derived from the model gap-free daily SSS product clearly demonstrated the daily oceanic mechanism during the summer season in the ECS at a detailed spatial scale. Notably, the CDW front in the zonal direction, as captured by the Ieodo Ocean Research Station (I-ORS), moved approximately 3.04 km d−1 in 2016, which is very fast compared with the cases in other years. Our model yielded a gap-free gridded daily SSS product with reasonable accuracy and enabled the successful recognition of daily SSS fronts at the 1 km level, which was previously not possible with ocean color data. Such successful application of machine learning models can further provide useful information on the long-term variation of daily SSS in the ECS. The gridded gap-free SSS dataset at 0.01°×0.01° spatial resolution is freely available at https://doi.org/10.22808/DATA-2023-2 (Shin et al., 2023).