Integrative Medicine Research (Mar 2025)

Methods for identifying health status from routinely collected health data: An overview

  • Mei Liu,
  • Ke Deng,
  • Mingqi Wang,
  • Qiao He,
  • Jiayue Xu,
  • Guowei Li,
  • Kang Zou,
  • Xin Sun,
  • Wen Wang

Journal volume & issue
Vol. 14, no. 1
p. 101100

Abstract

Read online

Routinely collected health data (RCD) are currently accelerating publications that evaluate the effectiveness and safety of medicines and medical devices. One of the fundamental steps in using these data is developing algorithms to identify health status that can be used for observational studies. However, the process and methodologies for identifying health status from RCD remain insufficiently understood. While most current methods rely on International Classification of Diseases (ICD) codes, they may not be universally applicable. Although machine learning methods hold promise for more accurately identifying the health status, they remain underutilized in RCD studies. To address these significant methodological gaps, we outline key steps and methodological considerations for identifying health statuses in observational studies using RCD. This review has the potential to boost the credibility of findings from observational studies that use RCD.

Keywords