Scientific Data (Jul 2025)

Large open access database of echocardiogram reports in intensive care unit patients

  • Gloria Hyunjung Kwak,
  • Dana Moukheiber,
  • Mira Moukheiber,
  • Lama Moukheiber,
  • Sulaiman Moukheiber,
  • Neel M. Butala,
  • Leo A. Celi,
  • Christina W. Chen

DOI
https://doi.org/10.1038/s41597-025-04849-5
Journal volume & issue
Vol. 12, no. 1
pp. 1 – 9

Abstract

Read online

Abstract The EchoNotes Structured Database derived from MIMIC-III (ECHO-NOTE2NUM) is a structured echocardiogram database derived from 43,472 observational notes obtained during echocardiogram studies conducted in the intensive care unit at the Beth Israel Deaconess Medical Center between 2001 and 2012. The database encompasses various aspects of cardiac structure and function, including cavity size, wall thickness, systolic and diastolic function, valve regurgitation and stenosis, as well as pulmonary pressures. To facilitate extensive data analysis, the clinical notes were transformed into a structured numerical format. Within each echocardiogram report sentence, specific words or phrases were identified to describe abnormal findings, and a severity staging system using numeric categories was established. This large publicly accessible database of structured echocardiogram data holds significant potential as a tool to investigate cardiovascular disease in the intensive care unit and as a reference point for future note-based structured databases. Moreover, its structured nature allows for easy integration with other data types in MIMIC, such as images or vital signs, enabling large-scale data analysis and further advancements in this field.