A distance-based kernel for classification via Support Vector Machines

Nazhir Amaya-Tejera; Margarita Gamarra; Jorge I. Vélez; Eduardo Zurek

doi:10.3389/frai.2024.1287875

Frontiers in Artificial Intelligence (Feb 2024)

A distance-based kernel for classification via Support Vector Machines

Nazhir Amaya-Tejera,
Margarita Gamarra,
Jorge I. Vélez,
Eduardo Zurek

Affiliations

Nazhir Amaya-Tejera: Department of Computer Science, Universidad del Norte, Barranquilla, Colombia
Margarita Gamarra: Department of Computer Science, Universidad del Norte, Barranquilla, Colombia
Jorge I. Vélez: Department of Industrial Engineering, Universidad del Norte, Barranquilla, Colombia
Eduardo Zurek: Department of Computer Science, Universidad del Norte, Barranquilla, Colombia

DOI: https://doi.org/10.3389/frai.2024.1287875
Journal volume & issue: Vol. 7

Abstract

Read online

Support Vector Machines (SVMs) are a type of supervised machine learning algorithm widely used for classification tasks. In contrast to traditional methods that split the data into separate training and testing sets, here we propose an innovative approach where subsets of the original data are randomly selected to train the model multiple times. This iterative training process aims to identify a representative data subset, leading to improved inferences about the population. Additionally, we introduce a novel distance-based kernel specifically designed for binary-type features based on a similarity matrix that efficiently handles both binary and multi-class classification problems. Computational experiments on publicly available datasets of varying sizes demonstrate that our proposed method significantly outperforms existing approaches in terms of classification accuracy. Furthermore, the distance-based kernel achieves superior performance compared to other well-known kernels from the literature and those used in previous studies on the same datasets. These findings validate the effectiveness of our proposed classification method and distance-based kernel for SVMs. By leveraging random subset selection and a unique kernel design, we achieve notable improvements in classification accuracy. These results have significant implications for diverse classification problems in Machine Learning and data analysis.

Published in Frontiers in Artificial Intelligence

ISSN: 2624-8212 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.frontiersin.org/journals/artificial-intelligence#

About the journal

Abstract

Keywords