Royal Society Open Science (Sep 2023)
Principal component and linear discriminant analyses for the classification of hominoid primate specimens based on bone shape data
Abstract
In this study, we tested the hypothesis that machine learning methods can accurately classify extant primates based on triquetrum shape data. We then used this classification tool to observe the affinities between extant primates and fossil hominoids. We assessed the discrimination accuracy for an unsupervised and supervised learning pipeline, i.e. with principal component analysis (PCA) and linear discriminant analysis (LDA) feature extraction, when tasked with the classification of extant primates. The trained algorithm is used to classify a sample of known fossil hominoids. For the visualization, PCA and uniform manifold approximation and projection (UMAP) are used. The results show that the discriminant function correctly classified the extant specimens with an F1-score of 0.90 for both PCA and LDA. In addition, the classification of fossil hominoids reflects taxonomy and locomotor behaviour reported in literature. This classification based on shape data using PCA and LDA is a powerful tool that can discriminate between the triquetrum shape of extant primates with high accuracy and quantitatively compare fossil and extant morphology. It can be used to support taxonomic differentiation and aid the further interpretation of fossil remains. Further testing is necessary by including other bones and more species and specimens per species extinct primates.
Keywords