Scientific Reports (Mar 2024)

Social and economic variables explain COVID-19 diffusion in European regions

  • Christian Cancedda,
  • Alessio Cappellato,
  • Luigi Maninchedda,
  • Leonardo Meacci,
  • Sofia Peracchi,
  • Claudia Salerni,
  • Elena Baralis,
  • Flavio Giobergia,
  • Stefano Ceri

DOI
https://doi.org/10.1038/s41598-024-56267-z
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 13

Abstract

Read online

Abstract At the beginning of 2020, Italy was the country with the highest number of COVID-19 cases, not only in Europe, but also in the rest of the world, and Lombardy was the most heavily hit region of Italy. The objective of this research is to understand which variables have determined the prevalence of cases in Lombardy and in other highly-affected European regions. We consider the first and second waves of the COVID-19 pandemic, using a set of 22 variables related to economy, population, healthcare and education. Regions with a high prevalence of cases are extracted by means of binary classifiers, then the most relevant variables for the classification are determined, and the robustness of the analysis is assessed. Our results show that the most meaningful features to identify high-prevalence regions include high number of hours spent in work environments, high life expectancy, and low number of people leaving from education and neither employed nor educated or trained.