Digital Diagnostics (Dec 2020)

MosMedData: data set of 1110 chest CT scans performed during the COVID-19 epidemic

  • Sergey P. Morozov,
  • Anna E. Andreychenko,
  • Ivan A. Blokhin,
  • Pavel B. Gelezhe,
  • Anna P. Gonchar,
  • Alexander E. Nikolaev,
  • Nikolay A. Pavlov,
  • Valeria Yu. Chernina,
  • Victor A. Gombolevskiy

DOI
https://doi.org/10.17816/DD46826
Journal volume & issue
Vol. 1, no. 1
pp. 49 – 59

Abstract

Read online

With the ongoing COVID-19 pandemic decreasing availability of polymerase chain reaction with reverse transcription and the snowballing growth of medical imaging, especially the number of chest computed tomography (CT) scans being performed, methods to augment and automate the image analysis, increasing productivity and minimizing human error are of particular importance. The creation of high-quality datasets is essential for the development and validation of artificial intelligence algorithms. Such technologies have sufficient accuracy in diagnosing COVID-19 in medical imaging. The presented large-scale dataset contains anonymized human CT scans with COVID-19 features as well as normal studies. Some studies were tagged by radiologists using binary pixel masks of regions of interest (e.g., characteristic areas of consolidation and ground-glass opacities). CT data were acquired between March 1, 2020, and April 25, 2020, and provided by municipal hospitals in Moscow, Russia. The presented dataset is licensed under Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported (CC BY-NC-ND 3.0).

Keywords