Scientific Data (Jul 2024)

Pixel-wise segmentation of cells in digitized Pap smear images

  • Balazs Harangi,
  • Gergo Bogacsovics,
  • Janos Toth,
  • Ilona Kovacs,
  • Erzsebet Dani,
  • Andras Hajdu

DOI
https://doi.org/10.1038/s41597-024-03566-9
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 8

Abstract

Read online

Abstract A simple and cheap way to recognize cervical cancer is using light microscopic analysis of Pap smear images. Training artificial intelligence-based systems becomes possible in this domain, e.g., to follow the European recommendation to screen negative smears to reduce false negative cases. The first step for such a process is segmenting the cells. A large and manually segmented dataset is required for this task, which can be used to train deep learning-based solutions. We describe a corresponding dataset with accurate manual segmentations for the enclosed cells. Altogether, the APACS23 (Annotated PAp smear images for Cell Segmentation 2023) dataset contains about 37 000 manually segmented cells and is separated into dedicated training and test parts, which could be used for an official benchmark of scientific investigations or a grand challenge.