Metric Ensembles Aid in Explainability: A Case Study with Wikipedia Data

Grant Forbes; R. Jordan Crouser

doi:10.3390/analytics2020017

Analytics (Apr 2023)

Metric Ensembles Aid in Explainability: A Case Study with Wikipedia Data

Grant Forbes,
R. Jordan Crouser

Affiliations

Grant Forbes: Department of Computer Science, North Carolina State University, Raleigh, NC 27695, USA
R. Jordan Crouser: Department of Computer Science, Smith College, Northampton, MA 01063, USA

DOI: https://doi.org/10.3390/analytics2020017
Journal volume & issue: Vol. 2, no. 2
pp. 315 – 327

Abstract

Read online

In recent years, as machine learning models have become larger and more complex, it has become both more difficult and more important to be able to explain and interpret the results of those models, both to prevent model errors and to inspire confidence for end users of the model. As such, there has been a significant and growing interest in explainability in recent years as a highly desirable trait for a model to have. Similarly, there has been much recent attention on ensemble methods, which aim to aggregate results from multiple (often simple) models or metrics in order to outperform models that optimize for only a single metric. We argue that this latter issue can actually assist with the former: a model that optimizes for several metrics has some base level of explainability baked into the model, and this explainability can be leveraged not only for user confidence but to fine-tune the weights between the metrics themselves in an intuitive way. We demonstrate a case study of such a benefit, in which we obtain clear, explainable results based on an aggregate of five simple metrics of relevance, using Wikipedia data as a proxy for some large text-based recommendation problem. We demonstrate that not only can these metrics’ simplicity and multiplicity be leveraged for explainability, but in fact, that very explainability can lead to an intuitive fine-tuning process that improves the model itself.

Published in Analytics

ISSN: 2813-2203 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science; Science: Mathematics: Probabilities. Mathematical statistics
Website: https://www.mdpi.com/journal/analytics

About the journal

Abstract

Keywords