Scientific Data (Aug 2023)

A large public dataset of annotated clinical MRIs and metadata of patients with acute stroke

  • Chin-Fu Liu,
  • Richard Leigh,
  • Brenda Johnson,
  • Victor Urrutia,
  • Johnny Hsu,
  • Xin Xu,
  • Xin Li,
  • Susumu Mori,
  • Argye E. Hillis,
  • Andreia V. Faria

DOI
https://doi.org/10.1038/s41597-023-02457-9
Journal volume & issue
Vol. 10, no. 1
pp. 1 – 14

Abstract

Read online

Abstract To extract meaningful and reproducible models of brain function from stroke images, for both clinical and research proposes, is a daunting task severely hindered by the great variability of lesion frequency and patterns. Large datasets are therefore imperative, as well as fully automated image post-processing tools to analyze them. The development of such tools, particularly with artificial intelligence, is highly dependent on the availability of large datasets to model training and testing. We present a public dataset of 2,888 multimodal clinical MRIs of patients with acute and early subacute stroke, with manual lesion segmentation, and metadata. The dataset provides high quality, large scale, human-supervised knowledge to feed artificial intelligence models and enable further development of tools to automate several tasks that currently rely on human labor, such as lesion segmentation, labeling, calculation of disease-relevant scores, and lesion-based studies relating function to frequency lesion maps.