Scientific Data (Nov 2024)

MIMIC-BP: A curated dataset for blood pressure estimation

  • Ivandro Sanches,
  • Victor V. Gomes,
  • Carlos Caetano,
  • Lizeth S. B. Cabrera,
  • Vinicius H. Cene,
  • Thomas Beltrame,
  • Wonkyu Lee,
  • Sanghyun Baek,
  • Otávio A. B. Penatti

DOI
https://doi.org/10.1038/s41597-024-04041-1
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Blood pressure (BP) is one of the most prominent indicators of potential cardiovascular disorders. Traditionally, BP measurement relies on inflatable cuffs, which is inconvenient and limit the acquisition of such important health-related information in general population. Based on large amounts of well-collected and annotated data, deep-learning approaches present a generalization potential that arose as an alternative to enable more pervasive approaches. However, most existing work in this area currently uses datasets with limitations, such as lack of subject identification and severe data imbalance that can result in data leakage and algorithm bias. Thus, to offer a more properly curated source of information, we propose a derivative dataset composed of 380 hours of the most common biomedical signals, including arterial blood pressure, photoplethysmography, and electrocardiogram for 1,524 anonymized subjects, each having 30 segments of 30 seconds of those signals. We also validated the proposed dataset through experiments using state-of-the-art deep-learning methods, as we highlight the importance of standardized benchmarks for calibration-free blood pressure estimation scenarios.