Scientific Data (Feb 2023)

An Open Dataset of Annotated Metaphase Cell Images for Chromosome Identification

  • Jenn-Jhy Tseng,
  • Chien-Hsing Lu,
  • Jun-Zhou Li,
  • Hui-Yu Lai,
  • Min-Hu Chen,
  • Fu-Yuan Cheng,
  • Chih-En Kuo

DOI
https://doi.org/10.1038/s41597-023-02003-7
Journal volume & issue
Vol. 10, no. 1
pp. 1 – 8

Abstract

Read online

Abstract Chromosomes are a principal target of clinical cytogenetic studies. While chromosomal analysis is an integral part of prenatal care, the conventional manual identification of chromosomes in images is time-consuming and costly. This study developed a chromosome detector that uses deep learning and that achieved an accuracy of 98.88% in chromosomal identification. Specifically, we compiled and made available a large and publicly accessible database containing chromosome images and annotations for training chromosome detectors. The database contains five thousand 24 chromosome class annotations and 2,000 single chromosome annotations. This database also contains examples of chromosome variations. Our database provides a reference for researchers in this field and may help expedite the development of clinical applications.