Remote Sensing (May 2018)

A Statistical Modeling Framework for Characterising Uncertainty in Large Datasets: Application to Ocean Colour

  • Peter E. Land,
  • Trevor C. Bailey,
  • Malcolm Taberner,
  • Silvia Pardo,
  • Shubha Sathyendranath,
  • Kayvan Nejabati Zenouz,
  • Vicki Brammall,
  • Jamie D. Shutler,
  • Graham D. Quartly

DOI
https://doi.org/10.3390/rs10050695
Journal volume & issue
Vol. 10, no. 5
p. 695

Abstract

Read online

Uncertainty estimation is crucial to establishing confidence in any data analysis, and this is especially true for Essential Climate Variables, including ocean colour. Methods for deriving uncertainty vary greatly across data types, so a generic statistics-based approach applicable to multiple data types is an advantage to simplify the use and understanding of uncertainty data. Progress towards rigorous uncertainty analysis of ocean colour has been slow, in part because of the complexity of ocean colour processing. Here, we present a general approach to uncertainty characterisation, using a database of satellite-in situ matchups to generate a statistical model of satellite uncertainty as a function of its contributing variables. With an example NASA MODIS-Aqua chlorophyll-a matchups database mostly covering the north Atlantic, we demonstrate a model that explains 67% of the squared error in log(chlorophyll-a) as a potentially correctable bias, with the remaining uncertainty being characterised as standard deviation and standard error at each pixel. The method is quite general, depending only on the existence of a suitable database of matchups or reference values, and can be applied to other sensors and data types such as other satellite observed Essential Climate Variables, empirical algorithms derived from in situ data, or even model data.

Keywords