Heliyon (Feb 2024)
Increasing segmentation performance with synthetic agar plate images
Abstract
Background: Agar plate analysis is vital for microbiological testing in industries like food, pharmaceuticals, and biotechnology. Manual inspection is slow, laborious, and error-prone, while existing automated systems struggle with the complexity of real-world agar plates. A shortage of diverse datasets hinders the development and evaluation of robust automated systems. Methods: In this paper, two new annotated datasets and a novel methodology for synthetic agar plate generation are presented. The datasets comprise 854 images of cultivated agar plates and 1,588 images of empty agar plates, encompassing various agar plate types and microorganisms. These datasets are an extension of the publicly available BRUKERCOLONY dataset, collectively forming one of the largest publicly available annotated datasets for research. The methodology is based on an efficient image generation pipeline that also simulates cultivation-related phenomena such as haemolysis or chromogenic reactions. Results: The augmentations significantly improved the Dice coefficient of trained U-Net models, increasing it from 0.671 to 0.721. Furthermore, training the U-Net model with a combination of real and 150% synthetic data demonstrated its efficacy, yielding a remarkable Dice coefficient of 0.729, a substantial improvement from the baseline of 0.518. UNet3+ exhibited the highest performance among the U-Net and Attention U-Net architectures, achieving a Dice coefficient of 0.767. Conclusions: Our experiments showed the methodology's applicability to real-world scenarios, even with highly variable agar plates. Our paper contributes to automating agar plate analysis by presenting a new dataset and effective methodology, potentially enhancing fully automated microbiological testing.