PLoS ONE (Jan 2022)

COVID-19 Risk Stratification and Mortality Prediction in Hospitalized Indian Patients: Harnessing clinical data for public health benefits.

  • Shanmukh Alle,
  • Akshay Kanakan,
  • Samreen Siddiqui,
  • Akshit Garg,
  • Akshaya Karthikeyan,
  • Priyanka Mehta,
  • Neha Mishra,
  • Partha Chattopadhyay,
  • Priti Devi,
  • Swati Waghdhare,
  • Akansha Tyagi,
  • Bansidhar Tarai,
  • Pranjal Pratim Hazarik,
  • Poonam Das,
  • Sandeep Budhiraja,
  • Vivek Nangia,
  • Arun Dewan,
  • Ramanathan Sethuraman,
  • C Subramanian,
  • Mashrin Srivastava,
  • Avinash Chakravarthi,
  • Johnny Jacob,
  • Madhuri Namagiri,
  • Varma Konala,
  • Debasish Dash,
  • Tavpritesh Sethi,
  • Sujeet Jha,
  • Anurag Agrawal,
  • Rajesh Pandey,
  • P K Vinod,
  • U Deva Priyakumar

DOI
https://doi.org/10.1371/journal.pone.0264785
Journal volume & issue
Vol. 17, no. 3
p. e0264785

Abstract

Read online

The variability of clinical course and prognosis of COVID-19 highlights the necessity of patient sub-group risk stratification based on clinical data. In this study, clinical data from a cohort of Indian COVID-19 hospitalized patients is used to develop risk stratification and mortality prediction models. We analyzed a set of 70 clinical parameters including physiological and hematological for developing machine learning models to identify biomarkers. We also compared the Indian and Wuhan cohort, and analyzed the role of steroids. A bootstrap averaged ensemble of Bayesian networks was also learned to construct an explainable model for discovering actionable influences on mortality and days to outcome. We discovered blood parameters, diabetes, co-morbidity and SpO2 levels as important risk stratification features, whereas mortality prediction is dependent only on blood parameters. XGboost and logistic regression model yielded the best performance on risk stratification and mortality prediction, respectively (AUC score 0.83, AUC score 0.92). Blood coagulation parameters (ferritin, D-Dimer and INR), immune and inflammation parameters IL6, LDH and Neutrophil (%) are common features for both risk and mortality prediction. Compared with Wuhan patients, Indian patients with extreme blood parameters indicated higher survival rate. Analyses of medications suggest that a higher proportion of survivors and mild patients who were administered steroids had extreme neutrophil and lymphocyte percentages. The ensemble averaged Bayesian network structure revealed serum ferritin to be the most important predictor for mortality and Vitamin D to influence severity independent of days to outcome. The findings are important for effective triage during strains on healthcare infrastructure.