Deep learning approaches to landmark detection in tsetse wing images.

Dylan S Geldenhuys; Shane Josias; Willie Brink; Mulanga Makhubele; Cang Hui; Pietro Landi; Jeremy Bingham; John Hargrove; Marijn C Hazelbag

doi:10.1371/journal.pcbi.1011194

PLoS Computational Biology (Jun 2023)

Deep learning approaches to landmark detection in tsetse wing images.

Dylan S Geldenhuys,
Shane Josias,
Willie Brink,
Mulanga Makhubele,
Cang Hui,
Pietro Landi,
Jeremy Bingham,
John Hargrove,
Marijn C Hazelbag

Affiliations

Dylan S Geldenhuys
Shane Josias
Willie Brink
Mulanga Makhubele
Cang Hui
Pietro Landi
Jeremy Bingham
John Hargrove
Marijn C Hazelbag

DOI: https://doi.org/10.1371/journal.pcbi.1011194
Journal volume & issue: Vol. 19, no. 6
p. e1011194

Abstract

Read online

Morphometric analysis of wings has been suggested for identifying and controlling isolated populations of tsetse (Glossina spp), vectors of human and animal trypanosomiasis in Africa. Single-wing images were captured from an extensive data set of field-collected tsetse wings of species Glossina pallidipes and G. m. morsitans. Morphometric analysis required locating 11 anatomical landmarks on each wing. The manual location of landmarks is time-consuming, prone to error, and infeasible for large data sets. We developed a two-tier method using deep learning architectures to classify images and make accurate landmark predictions. The first tier used a classification convolutional neural network to remove most wings that were missing landmarks. The second tier provided landmark coordinates for the remaining wings. We compared direct coordinate regression using a convolutional neural network and segmentation using a fully convolutional network for the second tier. For the resulting landmark predictions, we evaluate shape bias using Procrustes analysis. We pay particular attention to consistent labelling to improve model performance. For an image size of 1024 × 1280, data augmentation reduced the mean pixel distance error from 8.3 (95% confidence interval [4.4,10.3]) to 5.34 (95% confidence interval [3.0,7.0]) for the regression model. For the segmentation model, data augmentation did not alter the mean pixel distance error of 3.43 (95% confidence interval [1.9,4.4]). Segmentation had a higher computational complexity and some large outliers. Both models showed minimal shape bias. We deployed the regression model on the complete unannotated data consisting of 14,354 pairs of wing images since this model had a lower computational cost and more stable predictions than the segmentation model. The resulting landmark data set was provided for future morphometric analysis. The methods we have developed could provide a starting point to studying the wings of other insect species. All the code used in this study has been written in Python and open sourced.

Published in PLoS Computational Biology

ISSN: 1553-734X (Print); 1553-7358 (Online)
Publisher: Public Library of Science (PLoS)
Country of publisher: United States
LCC subjects: Science: Biology (General)
Website: https://journals.plos.org/ploscompbiol/

About the journal