Data in Brief (Dec 2024)
E-Staining DermaRepo: H&E whole slide image staining datasetMendeley Data
Abstract
In the era of artificial intelligence and machine learning, computer-aided diagnostic frameworks are data-hungry and require large amounts of annotated data to automate the disease diagnosis procedure. Moreover, to enhance the performance and accuracy of disease diagnosis, procedures need to be automated to ensure timely and accurate diagnosis. We are providing a whole slide image repository comprising unstained skin biopsy images acquired using a brightfield microscope, along with Hematoxylin and Eosin chemically and virtually stained image samples, to virtualize the staining procedure and enhance the efficiency of the disease diagnosis pipeline. The dataset was utilized to train a Dual Contrastive GAN to generate virtually stained image samples. The trained model achieved an FID score of 80.47 between virtually stained and chemically stained image samples, indicating a high correlation of content between synthesized and original images. In contrast, FID scores of 342.01 and 320.40 were observed between unstained images and virtually stained slides, and between unstained images and chemically stained images, respectively, indicating less similarity in content.