Aggregation strategies to improve XAI for geoscience models that use correlated, high-dimensional rasters

Evan Krell; Hamid Kamangir; Waylon Collins; Scott A. King; Philippe Tissot

doi:10.1017/eds.2023.39

Environmental Data Science (Jan 2023)

Aggregation strategies to improve XAI for geoscience models that use correlated, high-dimensional rasters

Evan Krell,
Hamid Kamangir,
Waylon Collins,
Scott A. King,
Philippe Tissot

Affiliations

Evan Krell: ORCiD; Department of Computer Science, Texas A&M University - Corpus Christi, Corpus Christi, Texas, USA Innovation in COmputer REsearch Lab (iCORE), Texas A&M University - Corpus Christi, Corpus Christi, Texas, USA Conrad Blucher Institute for Surveying and Science, Texas A&M University - Corpus Christi, Corpus Christi, Texas, USA NSF AI Institute for Research on Trustworthy AI in Weather, Climate and Coastal Oceanography
Hamid Kamangir: Conrad Blucher Institute for Surveying and Science, Texas A&M University - Corpus Christi, Corpus Christi, Texas, USA NSF AI Institute for Research on Trustworthy AI in Weather, Climate and Coastal Oceanography
Waylon Collins: National Weather Service, Corpus Christi, Texas, USA NSF AI Institute for Research on Trustworthy AI in Weather, Climate and Coastal Oceanography
Scott A. King: Department of Computer Science, Texas A&M University - Corpus Christi, Corpus Christi, Texas, USA Innovation in COmputer REsearch Lab (iCORE), Texas A&M University - Corpus Christi, Corpus Christi, Texas, USA NSF AI Institute for Research on Trustworthy AI in Weather, Climate and Coastal Oceanography
Philippe Tissot: Conrad Blucher Institute for Surveying and Science, Texas A&M University - Corpus Christi, Corpus Christi, Texas, USA NSF AI Institute for Research on Trustworthy AI in Weather, Climate and Coastal Oceanography

DOI: https://doi.org/10.1017/eds.2023.39
Journal volume & issue: Vol. 2

Abstract

Read online

Complex machine learning architectures and high-dimensional gridded input data are increasingly used to develop high-performance geoscience models, but model complexity obfuscates their decision-making strategies. Understanding the learned patterns is useful for model improvement or scientific investigation, motivating research in eXplainable artificial intelligence (XAI) methods. XAI methods often struggle to produce meaningful explanations of correlated features. Gridded geospatial data tends to have extensive autocorrelation so it is difficult to obtain meaningful explanations of geoscience models. A recommendation is to group correlated features and explain those groups. This is becoming common when using XAI to explain tabular data. Here, we demonstrate that XAI algorithms are highly sensitive to the choice of how we group raster elements. We demonstrate that reliance on a single partition scheme yields misleading explanations. We propose comparing explanations from multiple grouping schemes to extract more accurate insights from XAI. We argue that each grouping scheme probes the model in a different way so that each asks a different question of the model. By analyzing where the explanations agree and disagree, we can learn information about the scale of the learned features. FogNet, a complex three-dimensional convolutional neural network for coastal fog prediction, is used as a case study for investigating the influence of feature grouping schemes on XAI. Our results demonstrate that careful consideration of how each grouping scheme probes the model is key to extracting insights and avoiding misleading interpretations.

Published in Environmental Data Science

ISSN: 2634-4602 (Online)
Publisher: Cambridge University Press
Country of publisher: United Kingdom
LCC subjects: Geography. Anthropology. Recreation: Environmental sciences; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.cambridge.org/core/journals/environmental-data-science

About the journal

Abstract

Keywords