Foundations of Computing and Decision Sciences (Sep 2024)

On feature extraction using distances from reference points

  • Piernik Maciej,
  • Morzy Tadeusz,
  • Susmaga Robert,
  • Szczęch Izabela

DOI
https://doi.org/10.2478/fcds-2024-0015
Journal volume & issue
Vol. 49, no. 3
pp. 287 – 302

Abstract

Read online

Feature extraction is the key to a successfully trained classifier. Although many automatic methods exist for traditional data, other data types (e.g., sequences, graphs) usually require dedicated approaches. In this paper, we study a universal feature extraction method based on distance from reference points. First, we formalize this process and provide an instantiation based on network centrality. To reliably select the best reference points, we introduce the notion of θ-neighborhood which allows us to navigate the topography of fully connected graphs. Our experiments show that the proposed peak selection method is significantly better than a traditional top-k approach for centrality-based reference points and that the quality of the reference points is much less important than their quantity. Finally, we provide an alternative, neural network interpretation of reference points, which paves a path to optimization-based selection methods, together with a new type of neuron, called the Euclidean neuron, and the necessary modifications to backpropagation.

Keywords