PLoS Neglected Tropical Diseases (Feb 2023)
Data-driven predictions of potential Leishmania vectors in the Americas.
Abstract
The incidence of vector-borne diseases is rising as deforestation, climate change, and globalization bring humans in contact with arthropods that can transmit pathogens. In particular, incidence of American Cutaneous Leishmaniasis (ACL), a disease caused by parasites transmitted by sandflies, is increasing as previously intact habitats are cleared for agriculture and urban areas, potentially bringing people into contact with vectors and reservoir hosts. Previous evidence has identified dozens of sandfly species that have been infected with and/or transmit Leishmania parasites. However, there is an incomplete understanding of which sandfly species transmit the parasite, complicating efforts to limit disease spread. Here, we apply machine learning models (boosted regression trees) to leverage biological and geographical traits of known sandfly vectors to predict potential vectors. Additionally, we generate trait profiles of confirmed vectors and identify important factors in transmission. Our model performed well with an average out of sample accuracy of 86%. The models predict that synanthropic sandflies living in areas with greater canopy height, less human modification, and within an optimal range of rainfall are more likely to be Leishmania vectors. We also observed that generalist sandflies that are able to inhabit many different ecoregions are more likely to transmit the parasites. Our results suggest that Psychodopygus amazonensis and Nyssomia antunesi are unidentified potential vectors, and should be the focus of sampling and research efforts. Overall, we found that our machine learning approach provides valuable information for Leishmania surveillance and management in an otherwise complex and data sparse system.