Frontiers in Zoology (Apr 2024)

Fine-grained image classification on bats using VGG16-CBAM: a practical example with 7 horseshoe bats taxa (CHIROPTERA: Rhinolophidae: Rhinolophus) from Southern China

  • Zhong Cao,
  • Kunhui Wang,
  • Jiawei Wen,
  • Chuxian Li,
  • Yi Wu,
  • Xiaoyun Wang,
  • Wenhua Yu

DOI
https://doi.org/10.1186/s12983-024-00531-5
Journal volume & issue
Vol. 21, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Background Rapid identification and classification of bats are critical for practical applications. However, species identification of bats is a typically detrimental and time-consuming manual task that depends on taxonomists and well-trained experts. Deep Convolutional Neural Networks (DCNNs) provide a practical approach for the extraction of the visual features and classification of objects, with potential application for bat classification. Results In this study, we investigated the capability of deep learning models to classify 7 horseshoe bat taxa (CHIROPTERA: Rhinolophus) from Southern China. We constructed an image dataset of 879 front, oblique, and lateral targeted facial images of live individuals collected during surveys between 2012 and 2021. All images were taken using a standard photograph protocol and setting aimed at enhancing the effectiveness of the DCNNs classification. The results demonstrated that our customized VGG16-CBAM model achieved up to 92.15% classification accuracy with better performance than other mainstream models. Furthermore, the Grad-CAM visualization reveals that the model pays more attention to the taxonomic key regions in the decision-making process, and these regions are often preferred by bat taxonomists for the classification of horseshoe bats, corroborating the validity of our methods. Conclusion Our finding will inspire further research on image-based automatic classification of chiropteran species for early detection and potential application in taxonomy.

Keywords