Acta Scientiarum: Agronomy (Oct 2023)

Image analysis of seeds and machine learning as a tool for distinguishing populations: Applied to an invasive tree species

  • Francival Cardoso Felix,
  • Kyvia Pontes Teixeira das Chagas,
  • Fernando dos Santos Araújo,
  • Josenilda Aprigio Dantas de Medeiros,
  • Fábio de Almeida Vieira,
  • Salvador Barros Torres,
  • Mauro Vasconcelos Pacheco

DOI
https://doi.org/10.4025/actasciagron.v46i1.62658
Journal volume & issue
Vol. 46, no. 1

Abstract

Read online

Invasive species threaten crops and ecosystems worldwide. Therefore, we sought to understand the relationship between the geographic distribution of species populations and the characteristics of seeds using new techniques such as seed image analysis, multivariate analysis, and machine learning. This study aimed to characterize Leucaena leucocephala (Lam.) de Wit. seeds from spatially dispersed populations using digital images and analyzed their implications for genetic studies. Seed size and shape descriptors were obtained using image analysis of the five populations. Several analyses were performed including descriptive statistics, principal components, Euclidean distance, Mantel correlation test, and supervised machine learning. This image analysis technique proved to be efficient in detecting biometric differences in L. leucocephala seeds from spatially dispersed populations. This method revealed that spatially dispersed L. leucocephala populations had different biometric seed patterns that can be used in studies of population genetic divergence. We observed that it is possible to identify the origin of the seeds from the biometric characters with 80.4% accuracy (Kappa statistic 0.755) when we applied the decision tree algorithm. Digital imaging analysis associated with machine learning is promising for discriminating forest tree populations, supporting management activities, and studying population genetic divergence. This technique contributes to the understanding of genotype-environment interactions and consequently identifies the ability of an invasive species to spread in a new area, making it possible to track and monitor the flow of seeds between populations and other sites.

Keywords