Technical note: Inherent benchmark or not? Comparing Nash–Sutcliffe and Kling–Gupta efficiency scores

W. J. M. Knoben; W. J. M. Knoben; J. E. Freer; J. E. Freer; R. A. Woods; R. A. Woods

doi:10.5194/hess-23-4323-2019

Hydrology and Earth System Sciences (Oct 2019)

Technical note: Inherent benchmark or not? Comparing Nash–Sutcliffe and Kling–Gupta efficiency scores

W. J. M. Knoben,
W. J. M. Knoben,
J. E. Freer,
J. E. Freer,
R. A. Woods,
R. A. Woods

Affiliations

W. J. M. Knoben: Department of Civil Engineering, University of Bristol, Bristol, BS8 1TR, UK
W. J. M. Knoben: now at: University of Saskatchewan Coldwater Laboratory, Canmore, Alberta, Canada
J. E. Freer: School of Geographical Sciences, University of Bristol, Bristol, BS8 1BF, UK
J. E. Freer: Cabot Institute, University of Bristol, Bristol, BS8 1UJ, UK
R. A. Woods: Department of Civil Engineering, University of Bristol, Bristol, BS8 1TR, UK
R. A. Woods: Cabot Institute, University of Bristol, Bristol, BS8 1UJ, UK

DOI: https://doi.org/10.5194/hess-23-4323-2019
Journal volume & issue: Vol. 23
pp. 4323 – 4331

Abstract

Read online

A traditional metric used in hydrology to summarize model performance is the Nash–Sutcliffe efficiency (NSE). Increasingly an alternative metric, the Kling–Gupta efficiency (KGE), is used instead. When NSE is used, NSE = 0 corresponds to using the mean flow as a benchmark predictor. The same reasoning is applied in various studies that use KGE as a metric: negative KGE values are viewed as bad model performance, and only positive values are seen as good model performance. Here we show that using the mean flow as a predictor does not result in KGE = 0, but instead KGE =1-√2≈-0.41. Thus, KGE values greater than −0.41 indicate that a model improves upon the mean flow benchmark – even if the model's KGE value is negative. NSE and KGE values cannot be directly compared, because their relationship is non-unique and depends in part on the coefficient of variation of the observed time series. Therefore, modellers who use the KGE metric should not let their understanding of NSE values guide them in interpreting KGE values and instead develop new understanding based on the constitutive parts of the KGE metric and the explicit use of benchmark values to compare KGE scores against. More generally, a strong case can be made for moving away from ad hoc use of aggregated efficiency metrics and towards a framework based on purpose-dependent evaluation metrics and benchmarks that allows for more robust model adequacy assessment.

Published in Hydrology and Earth System Sciences

ISSN: 1027-5606 (Print); 1607-7938 (Online)
Publisher: Copernicus Publications
Country of publisher: Germany
LCC subjects: Technology: Environmental technology. Sanitary engineering; Geography. Anthropology. Recreation: Environmental sciences
Website: http://www.hydrology-and-earth-system-sciences.net/

About the journal