Mathematics (Jan 2023)
Bisecting for Selecting: Using a Laplacian Eigenmaps Clustering Approach to Create the New European Football Super League
Abstract
Ranking sports teams generally relies on supervised techniques, requiring either prior knowledge or arbitrary metrics. In this paper, we offer a purely unsupervised technique. We apply this to operational decision-making, specifically, the controversial European Super League for association football, demonstrating how this approach can select dominant teams to form the new league. We first use random forest regression to select important variables predicting goal difference, which we use to calculate the Euclidian distances between teams. Creating a Laplacian eigenmap, we bisect the Fiedler vector to identify the natural clusters in five major European football leagues. Our results show how an unsupervised approach could identify four clusters based on five basic performance metrics: shots, shots on target, shots conceded, possession, and pass success. The top two clusters identify teams that dominate their respective leagues and are the best candidates to create the most competitive elite super league.
Keywords