Machine learning approaches classify clinical malaria outcomes based on haematological parameters

Collins M. Morang’a; Lucas Amenga–Etego; Saikou Y. Bah; Vincent Appiah; Dominic S. Y. Amuzu; Nicholas Amoako; James Abugri; Abraham R. Oduro; Aubrey J. Cunnington; Gordon A. Awandare; Thomas D. Otto

doi:10.1186/s12916-020-01823-3

BMC Medicine (Nov 2020)

Machine learning approaches classify clinical malaria outcomes based on haematological parameters

Collins M. Morang’a,
Lucas Amenga–Etego,
Saikou Y. Bah,
Vincent Appiah,
Dominic S. Y. Amuzu,
Nicholas Amoako,
James Abugri,
Abraham R. Oduro,
Aubrey J. Cunnington,
Gordon A. Awandare,
Thomas D. Otto

Affiliations

Collins M. Morang’a: West African Centre for Cell Biology of Infectious Pathogens (WACCBIP), Department of Biochemistry, Cell and Molecular Biology, University of Ghana
Lucas Amenga–Etego: West African Centre for Cell Biology of Infectious Pathogens (WACCBIP), Department of Biochemistry, Cell and Molecular Biology, University of Ghana
Saikou Y. Bah: West African Centre for Cell Biology of Infectious Pathogens (WACCBIP), Department of Biochemistry, Cell and Molecular Biology, University of Ghana
Vincent Appiah: West African Centre for Cell Biology of Infectious Pathogens (WACCBIP), Department of Biochemistry, Cell and Molecular Biology, University of Ghana
Dominic S. Y. Amuzu: West African Centre for Cell Biology of Infectious Pathogens (WACCBIP), Department of Biochemistry, Cell and Molecular Biology, University of Ghana
Nicholas Amoako: West African Centre for Cell Biology of Infectious Pathogens (WACCBIP), Department of Biochemistry, Cell and Molecular Biology, University of Ghana
James Abugri: Department of Applied Chemistry and Biochemistry, C. K Tedam University of Technology and Applied Sciences
Abraham R. Oduro: Ministry of Health, Navrongo Health Research Centre (NHRC)
Aubrey J. Cunnington: Section of Pediatric Infectious Disease, Department of Infectious Disease, Imperial College London
Gordon A. Awandare: West African Centre for Cell Biology of Infectious Pathogens (WACCBIP), Department of Biochemistry, Cell and Molecular Biology, University of Ghana
Thomas D. Otto: Institute of Infection, Immunity & Inflammation, MVLS, University of Glasgow

DOI: https://doi.org/10.1186/s12916-020-01823-3
Journal volume & issue: Vol. 18, no. 1
pp. 1 – 16

Abstract

Read online

Abstract Background Malaria is still a major global health burden, with more than 3.2 billion people in 91 countries remaining at risk of the disease. Accurately distinguishing malaria from other diseases, especially uncomplicated malaria (UM) from non-malarial infections (nMI), remains a challenge. Furthermore, the success of rapid diagnostic tests (RDTs) is threatened by Pfhrp2/3 deletions and decreased sensitivity at low parasitaemia. Analysis of haematological indices can be used to support the identification of possible malaria cases for further diagnosis, especially in travellers returning from endemic areas. As a new application for precision medicine, we aimed to evaluate machine learning (ML) approaches that can accurately classify nMI, UM, and severe malaria (SM) using haematological parameters. Methods We obtained haematological data from 2,207 participants collected in Ghana: nMI (n = 978), SM (n = 526), and UM (n = 703). Six different ML approaches were tested, to select the best approach. An artificial neural network (ANN) with three hidden layers was used for multi-classification of UM, SM, and uMI. Binary classifiers were developed to further identify the parameters that can distinguish UM or SM from nMI. Local interpretable model-agnostic explanations (LIME) were used to explain the binary classifiers. Results The multi-classification model had greater than 85% training and testing accuracy to distinguish clinical malaria from nMI. To distinguish UM from nMI, our approach identified platelet counts, red blood cell (RBC) counts, lymphocyte counts, and percentages as the top classifiers of UM with 0.801 test accuracy (AUC = 0.866 and F1 score = 0.747). To distinguish SM from nMI, the classifier had a test accuracy of 0.96 (AUC = 0.983 and F1 score = 0.944) with mean platelet volume and mean cell volume being the unique classifiers of SM. Random forest was used to confirm the classifications, and it showed that platelet and RBC counts were the major classifiers of UM, regardless of possible confounders such as patient age and sampling location. Conclusion The study provides proof of concept methods that classify UM and SM from nMI, showing that the ML approach is a feasible tool for clinical decision support. In the future, ML approaches could be incorporated into clinical decision-support algorithms for the diagnosis of acute febrile illness and monitoring response to acute SM treatment particularly in endemic settings.

Published in BMC Medicine

ISSN: 1741-7015 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine
Website: http://bmcmedicine.biomedcentral.com

About the journal

Abstract

Keywords