Data Science Journal (Jan 2012)

H-Metric: Characterizing Image Datasets via Homogenization Based on KNN-Queries

  • Welington M da Silva,
  • Jose F Rodrigues Jr,
  • Agma J M Traina,
  • Sergio F da Silva

DOI
https://doi.org/10.2481/dsj.10-007
Journal volume & issue
Vol. 10
pp. 52 – 60

Abstract

Read online

Precision-Recall is one of the main metrics for evaluating content-based image retrieval techniques. However, it does not provide an ample perception of the properties of an image dataset immersed in a metric space. In this work, we describe an alternative metric named H-Metric, which is determined along a sequence of controlled modifications in the image dataset. The process is named homogenization and works by altering the homogeneity characteristics of the classes of the images. The result is a process that measures how hard it is to deal with a set of images in respect to content-based retrieval, offering support in the task of analyzing configurations of distance functions and of features extractors.

Keywords