Earth System Science Data (Dec 2024)
Reprocessing of eXpendable BathyThermograph (XBT) profiles from the Ligurian and Tyrrhenian seas over the time period 1999–2019 with a full metadata upgrade
Abstract
The advent of open science and the United Nations Decade of Ocean Science for Sustainable Development are revolutionizing the ocean-data-sharing landscape for an efficient and transparent ocean information and knowledge generation. This blue revolution raised awareness on the importance of metadata and community standards to activate interoperability of the digital assets (data and services) and guarantee that data-driven science preserves provenance, lineage and quality information for its replicability. Historical data are frequently not compliant with these criteria, lacking metadata information that was not retained, crucial at the time of data generation and further ingestion into marine data infrastructures. The present data review is an example attempt to fill this gap through a thorough data reprocessing starting from the original raw data and operational log sheets. The data gathered using XBT (eXpendable BathyThermograph) probes during several monitoring activities in the Tyrrhenian and Ligurian seas between 1999 and 2019 have first been formatted and standardized according to the latest community best practices and all available metadata have been inserted, including calibration information never applied, uncertainty specification and bias correction from Cheng et al. (2014). Secondly, a new automatic quality control (QC) procedure has been developed and a new interpolation scheme applied. The reprocessed (REP) dataset has been compared to the data version, presently available from the SeaDataNet (SDN) data access portal, processed according to the pioneering work of Manzella et al. (2003) conducted in the framework of the European Union Mediterranean Forecasting System Pilot Project (Pinardi et al., 2003). The comparison between REP and SDN datasets has the objective to highlight the main differences derived from the new data processing process. The maximum discrepancy among the REP and SDN data versions always resides within the surface layer (REP profiles are warmer than SDN ones) until 150 m depth generally when the thermocline settles (from June to November). The overall bias and root mean square difference are equal to 0.002 and 0.041 °C, respectively. Such differences are mainly due to the new interpolation technique (Barker and McDougall, 2020) and the application of the calibration correction in the REP dataset. The REP dataset (Reseghetti et al., 2024; https://doi.org/10.13127/rep_xbt_1999_2019.2) is available and accessible through the INGV (Istituto Nazionale di Geofisica e Vulcanologia, Bologna) ERDDAP (Environmental Research Division's Data Access Program) server, which allows for machine-to-machine data access in compliance with the FAIR (findable, accessible, interoperable and reusable) principles (Wilkinson et al., 2016).