Scientific Reports (Jul 2024)

The role of morphometric characteristics in predicting 20-meter sprint performance through machine learning

  • Ahmet Kurtoğlu,
  • Özgür Eken,
  • Rukiye Çiftçi,
  • Bekir Çar,
  • Emrah Dönmez,
  • Serhat Kılıçarslan,
  • Mona M. Jamjoom,
  • Nagwan Abdel Samee,
  • Dina S. M. Hassan,
  • Noha F. Mahmoud

DOI
https://doi.org/10.1038/s41598-024-67405-y
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 15

Abstract

Read online

Abstract The aim of this study was to test the morphometric features affecting 20-m sprint performance in children at the first level of primary education using machine learning (ML) algorithms. In this study, 130 male and 152 female volunteers aged between 6 and 11 years were included. After obtaining demographic information of the participants, skinfold thickness, diameter and circumference measurements, and 20-m sprint performance were determined. The study conducted three distinct experiments to determine the optimal ML technique for predicting outcomes. Initially, the entire feature space was utilized for training the ML models to establish a baseline performance. In the second experiment, only significant features identified through correlation analysis were used for training and testing the models, enhancing the focus on relevant predictors. Lastly, Principal Component Analysis (PCA) was employed to reduce the feature space, aiming to streamline model complexity while retaining data variance. These experiments collectively aimed to evaluate different feature selection and dimensionality reduction techniques, providing insights into the most effective strategies for optimizing predictive performance in the given context. The correlation-based selected features (Age, Height, waist circumference, hip circumference, leg length, thigh length, foot length) has produced a minimum Mean Squared Error (MSE) value of 0.012 for predicting the sprint performance in children. The effective utilization of correlation analysis in the selection of relevant features for our regression model suggests that the features selected exhibit robust linear associations with the target variable and can be relied upon as predictors.

Keywords