An Interpretable Machine Learning Approach for Hepatitis B Diagnosis

George Obaido; Blessing Ogbuokiri; Theo G. Swart; Nimibofa Ayawei; Sydney Mambwe Kasongo; Kehinde Aruleba; Ibomoiye Domor Mienye; Idowu Aruleba; Williams Chukwu; Fadekemi Osaye; Oluwaseun F. Egbelowo; Simelane Simphiwe; Ebenezer Esenogho

doi:10.3390/app122111127

Applied Sciences (Nov 2022)

An Interpretable Machine Learning Approach for Hepatitis B Diagnosis

George Obaido,
Blessing Ogbuokiri,
Theo G. Swart,
Nimibofa Ayawei,
Sydney Mambwe Kasongo,
Kehinde Aruleba,
Ibomoiye Domor Mienye,
Idowu Aruleba,
Williams Chukwu,
Fadekemi Osaye,
Oluwaseun F. Egbelowo,
Simelane Simphiwe,
Ebenezer Esenogho

Affiliations

George Obaido: Center for Human-Compatible Artificial Intelligence (CHAI), Berkeley Institute for Data Science (BIDS), University of California, Berkeley, CA 94720, USA
Blessing Ogbuokiri: Department of Mathematics and Statistics, York University, Toronto, ON M3J 1P3, Canada
Theo G. Swart: Center for Telecommunications, Department of Electrical and Electronic Engineering Science, University of Johannesburg, Johannesburg 2006, South Africa
Nimibofa Ayawei: Department of Chemistry, Bayelsa Medical University, Yenagoa PMB 178, Nigeria
Sydney Mambwe Kasongo: Department of Industrial Engineering, Faculty of Engineering, Stellenbosch University, Stellenbosch 7600, South Africa
Kehinde Aruleba: School of Computing and Mathematical Sciences, University of Leicester, Leicester LE1 7RH, UK
Ibomoiye Domor Mienye: Department of Electrical and Electronic Engineering Science, University of Johannesburg, Johannesburg 2006, South Africa
Idowu Aruleba: Department of Electrical and Electronic Engineering Science, University of Johannesburg, Johannesburg 2006, South Africa
Williams Chukwu: Department of Mathematics, Wake Forest University, Winston-Salem, NC 27109, USA
Fadekemi Osaye: Department of Mathematics and Computer Science, Alabama State University, Montgomery, AL 36104, USA
Oluwaseun F. Egbelowo: Department of Integrative Biology, The University of Texas at Austin, Austin, TX 78712, USA
Simelane Simphiwe: Department of Mathematics and Applied Mathematics, University of Johannesburg, Doornfontein 2028, South Africa
Ebenezer Esenogho: Center for Telecommunications, Department of Electrical and Electronic Engineering Science, University of Johannesburg, Johannesburg 2006, South Africa

DOI: https://doi.org/10.3390/app122111127
Journal volume & issue: Vol. 12, no. 21
p. 11127

Abstract

Read online

Hepatitis B is a potentially deadly liver infection caused by the hepatitis B virus. It is a serious public health problem globally. Substantial efforts have been made to apply machine learning in detecting the virus. However, the application of model interpretability is limited in the existing literature. Model interpretability makes it easier for humans to understand and trust the machine-learning model. Therefore, in this study, we used SHapley Additive exPlanations (SHAP), a game-based theoretical approach to explain and visualize the predictions of machine learning models applied for hepatitis B diagnosis. The algorithms used in building the models include decision tree, logistic regression, support vector machines, random forest, adaptive boosting (AdaBoost), and extreme gradient boosting (XGBoost), and they achieved balanced accuracies of 75%, 82%, 75%, 86%, 92%, and 90%, respectively. Meanwhile, the SHAP values showed that bilirubin is the most significant feature contributing to a higher mortality rate. Consequently, older patients are more likely to die with elevated bilirubin levels. The outcome of this study can aid health practitioners and health policymakers in explaining the result of machine learning models for health-related problems.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords