Xiehe Yixue Zazhi (Jan 2023)

Fungal Microscopic Image Classification Based on Multi-scale Attention Mechanism

  • ZHANG Xueyuan,
  • XU Hongyan,
  • DONG Yueming,
  • LIU Danfeng,
  • SUN Pengrui,
  • YAN Rui,
  • CUI Hongliang,
  • LEI Hong,
  • REN Fei

DOI
https://doi.org/10.12290/xhyxzz.2022-0169
Journal volume & issue
Vol. 14, no. 1
pp. 139 – 147

Abstract

Read online

Objective To establish a fungal image-assisted classification model using deep learning technology. Methods The microscope images of people infected with Aspergillus, Saccharomyces and Cryptococcus neoformans were retrospectively collected from the Eighth Medical Center of PLA General Hospital from September 2020 to April 2021. The images were randomly divided into training set, validation set and test set according to the ratio of 7∶1.5∶1.5. The improved MobileNetV2 network structure was trained using the training set, a convolutional neural network (CNN) fungal image 11 classification model based on multi-scale attention mechanism was constructed and the parameters were debugged based on the validation set. Machine identification results were taken as the gold standard, the performance of the model on 11 fungal image classification tasks was evaluated, and the results were shown by precision, recall and F1 value. In addition, the performance of the proposed model with 5 classic CNN models were compared, and the results were measured in terms of model parameters, memory usage, frames per second (FPS), accuracy, and area under the curve (AUC) of receiver operating characteristic curve. Results A total of 7666 fungal microscope images were collected, including 2781, 4115, and 770 images of Aspergillus, Saccharomyces, and Cryptococcus neoformans, respectively. Among them, there were 5366 training images, 1150 validation images, and 1150 test images. The improved MobileNetV2 model had high performance for the classification of 11 fungal images in the test set. The precision rate was distributed between 96.36% and 100%, the recall rate was distributed between 96.53% and 100%, and the F1 value was distributed between 97.01% and 100%. The parameters, memory usage, FPS, accuracy, and AUC of the improved MobileNetV2 model were 4.22 M, 356.89 M, 573, (99.09±0.18)%, and 0.9944±0.0018, respectively, and the comprehensive performance was better than 5 kinds of classical networks. Conclusion The proposed fungal image classification model based on the improved MobileNetV2 can obtain higher fungal image recognition ability while maintaining low computational cost, with an overall performance better than classical CNN model.

Keywords