Journal of Health Economics and Outcomes Research (Sep 2024)

Artificial Intelligence in Identifying Patients With Undiagnosed Nonalcoholic Steatohepatitis

  • Onur Baser,
  • Gabriela Samayoa,
  • Nehir Yapar,
  • Erdem Baser

Journal volume & issue
Vol. 11, no. 2

Abstract

Read online

**Background:** Although increasing in prevalence, nonalcoholic steatohepatitis (NASH) is often undiagnosed in clinical practice. **Objective:** This study identified patients in the Veterans Affairs (VA) health system who likely had undiagnosed NASH using a machine learning algorithm. **Methods:** From a VA data set of 25 million adult enrollees, the study population was divided into NASH-positive, non-NASH, and at-risk cohorts. We performed a claims data analysis using a machine learning algorithm. To build our model, the study population was randomly divided into an 80% training subset and a 20% testing subset and tested and trained using a cross-validation technique. In addition to the baseline model, a gradient-boosted classification tree, naïve Bayes, and random forest model were created and compared using receiver operator characteristics, area under the curve, and accuracy. The best performing model was retrained on the full 80% training subset and applied to the 20% testing subset to calculate the performance metrics. **Results:** In total, 4 223 443 patients met the study inclusion criteria, of whom 4903 were positive for NASH and 35 528 were non-NASH patients. The remainder was in the at-risk patient cohort, of which 514 997 patients (12%) were identified as likely to have NASH. Age, obesity, and abnormal liver function tests were the top determinants in assigning NASH probability. **Conclusions:** Utilization of machine learning to predict NASH allows for wider recognition, timely intervention, and targeted treatments to improve or mitigate disease progression and could be used as an initial screening tool.