Nature Communications (Nov 2022)

Mapping global dynamics of benchmark creation and saturation in artificial intelligence

  • Simon Ott,
  • Adriano Barbosa-Silva,
  • Kathrin Blagec,
  • Jan Brauner,
  • Matthias Samwald

DOI
https://doi.org/10.1038/s41467-022-34591-0
Journal volume & issue
Vol. 13, no. 1
pp. 1 – 11

Abstract

Read online

Recent studies raised concerns over the state of AI benchmarking, reporting issues such as benchmark overfitting, benchmark saturation and increasing centralization of benchmark dataset creation. To facilitate monitoring of the health of the AI benchmarking ecosystem, the authors introduce methodologies for creating condensed maps of the global dynamics of benchmark.