Gastro Hep Advances (Jan 2023)

Development and Validation of a Machine Learning–Based Prediction Model for Detection of Biliary Atresia

  • Ho Jung Choi,
  • Yeong Eun Kim,
  • Jung-Man Namgoong,
  • Inki Kim,
  • Jun Sung Park,
  • Woo Im Baek,
  • Byong Sop Lee,
  • Hee Mang Yoon,
  • Young Ah Cho,
  • Jin Seong Lee,
  • Jung Ok Shim,
  • Seak Hee Oh,
  • Jin Soo Moon,
  • Jae Sung Ko,
  • Dae Yeon Kim,
  • Kyung Mo Kim

Journal volume & issue
Vol. 2, no. 6
pp. 778 – 787

Abstract

Read online

Background and Aims: Biliary atresia is a rare and devastating bile duct disease that occurs during the neonatal period. Timely identification and prompt surgical intervention is critical for improving the outcome. The aim of the study was to develop a new machine learning–based prediction model for the detection of biliary atresia. Methods: Neonates aged <100 days with cholestasis at least once were retrospectively screened in 2 tertiary referral hospitals between 2015 and 2020. Simple demographic data, routine laboratory indices, and imaging findings of ultrasonography and hepatobiliary scintigraphy were used as features in the multivariate analysis. The extreme gradient boosting (XGBoost) framework was used to develop prediction models according to the diagnostic steps. Results: Among 1605 enrolled neonates with all-cause cholestasis, 145 (9%) were included as having biliary atresia. Direct bilirubin, gamma-glutamyl transpeptidase, abdominal sonography, and hepatobiliary scan were the most impactful features in prediction models. The Step II XGBoost model, consisting of nonimaging inputs, showed excellent discriminatory performance (area under the curve = 0.97). The Step III and IV XGBoost models showed near-perfect performances (area under the curve = 0.998 and 0.999, respectively). In external validation (n = 912 with 118 [12.9%] biliary atresia), XGBoost-based prediction models consistently showed acceptable performances. Utilizing shapley additive explanation values also provided visualized insight and explanation of the contribution of features in detecting biliary atresia. The models were integrated into a web-based diagnostic tool for case-level application. Conclusion: We introduced a new machine learning–based prediction model for detecting biliary atresia in the largest cohorts of neonatal cholestasis.

Keywords