Frontiers in Ecology and Evolution (Sep 2022)
An ensemble learning approach to map the genetic connectivity of the parasitoid Stethynium empoasca (Hymenoptera: Mymaridae) and identify the key influencing environmental and landscape factors
Abstract
The effect of landscape patterns and environmental factors on the population structure and genetic diversity of organisms is well-documented. However, this effect is still unclear in the case of Mymaridae parasitoids. Despite recent advances in machine learning methods for landscape genetics, ensemble learning still needs further investigation. Here, we evaluated the performance of different boosting algorithms and analyzed the effects of landscape and environmental factors on the genetic variations in the tea green leafhopper parasitoid Stethynium empoasca (Hymenoptera: Mymaridae). The S. empoasca populations showed a distinct pattern of isolation by distance. The minimum temperature of the coldest month, annual precipitation, the coverage of evergreen/deciduous needleleaf trees per 1 km2, and the minimum precipitation of the warmest quarter were identified as the dominant factors affecting the genetic divergence of S. empoasca populations. Notably, compared to previous machine learning studies, our model showed an unprecedented accuracy (r = 0.87) for the prediction of genetic differentiation. These findings not only demonstrated how the landscape shaped S. empoasca genetics but also provided an essential basis for developing conservation strategies for this biocontrol agent. In a broader sense, this study demonstrated the importance and efficiency of ensemble learning in landscape genetics.
Keywords