Methods in Ecology and Evolution (Feb 2024)
An open‐source general purpose machine learning framework for individual animal re‐identification using few‐shot learning
Abstract
Abstract Animal re‐identification remains a challenging problem due to the cost of tagging systems and the difficulty of permanently attaching a physical marker to some animals, such as sea stars. Due to these challenges, photo identification is a good fit to solve this problem whether evaluated by humans or through machine learning. Accurate machine learning methods are an improvement over manual identification as they are capable of evaluating a large number of images automatically and recent advances have reduced the need for large training datasets. This study aimed to create an accurate, robust, general purpose machine learning framework for individual animal re‐identification using images both from publicly available data as well as two groups of sea stars of different species under human care. Open‐source code was provided to accelerate work in this space. Images of two species of sea star (Asterias rubens and Anthenea australiae) were taken using a consumer‐grade smartphone camera and used as original datasets to train a machine learning model to re‐identify an individual animal using few examples. The model's performance was evaluated on these original sea star datasets which contained between 39–54 individuals and 983–1204 images, as well as using six publicly available re‐identification datasets for tigers, beef cattle noses, chimpanzee faces, zebras, giraffes and ringed seals ranging between 45–2056 individuals and 829–6770 images. Using time aware‐splits, which are a data splitting technique ensuring that the model only sees an individual's images from a previous collection event during training to avoid information leaking, the model achieved high (>99%) individual re‐identification mean average precision for the top prediction (mAP@1) for the two species of sea stars. The re‐identification mAP@1 for the mammalian datasets was more variable, ranging from 83% to >99%. However, this model outperformed published state‐of‐the‐art re‐identification results for the publicly available datasets. The reported approach for animal re‐identification is generalizable, with the same machine learning framework achieving good performance in two distinct species of sea stars with different physical attributes, as well as seven different mammalian species. This demonstrates that this methodology can be applied to nearly any species where individual re‐identification is required. This study presents a precise, practical, non‐invasive approach to animal re‐identification using only basic image collection methods.
Keywords