Early prediction of diagnostic-related groups and estimation of hospital cost by processing clinical notes

Jinghui Liu; Daniel Capurro; Anthony Nguyen; Karin Verspoor

doi:10.1038/s41746-021-00474-9

npj Digital Medicine (Jul 2021)

Early prediction of diagnostic-related groups and estimation of hospital cost by processing clinical notes

Jinghui Liu,
Daniel Capurro,
Anthony Nguyen,
Karin Verspoor

Affiliations

Jinghui Liu: School of Computing and Information Systems, The University of Melbourne
Daniel Capurro: School of Computing and Information Systems, The University of Melbourne
Anthony Nguyen: Australian e-Health Research Centre, CSIRO
Karin Verspoor: School of Computing and Information Systems, The University of Melbourne

DOI: https://doi.org/10.1038/s41746-021-00474-9
Journal volume & issue: Vol. 4, no. 1
pp. 1 – 8

Abstract

Read online

Abstract As healthcare providers receive fixed amounts of reimbursement for given services under DRG (Diagnosis-Related Groups) payment, DRG codes are valuable for cost monitoring and resource allocation. However, coding is typically performed retrospectively post-discharge. We seek to predict DRGs and DRG-based case mix index (CMI) at early inpatient admission using routine clinical text to estimate hospital cost in an acute setting. We examined a deep learning-based natural language processing (NLP) model to automatically predict per-episode DRGs and corresponding cost-reflecting weights on two cohorts (paid under Medicare Severity (MS) DRG or All Patient Refined (APR) DRG), without human coding efforts. It achieved macro-averaged area under the receiver operating characteristic curve (AUC) scores of 0·871 (SD 0·011) on MS-DRG and 0·884 (0·003) on APR-DRG in fivefold cross-validation experiments on the first day of ICU admission. When extended to simulated patient populations to estimate average cost-reflecting weights, the model increased its accuracy over time and obtained absolute CMI error of 2·40 (1·07%) and 12·79% (2·31%), respectively on the first day. As the model could adapt to variations in admission time, cohort size, and requires no extra manual coding efforts, it shows potential to help estimating costs for active patients to support better operational decision-making in hospitals.

Published in npj Digital Medicine

ISSN: 2398-6352 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://www.nature.com/npjdigitalmed/

About the journal