A unified Foot and Mouth Disease dataset for Uganda: evaluating machine learning predictive performance degradation under varying distributions

Geofrey Kapalaga; Florence N. Kivunike; Susan Kerfua; Daudi Jjingo; Daudi Jjingo; Savino Biryomumaisho; Justus Rutaisire; Paul Ssajjakambwe; Swidiq Mugerwa; Yusuf Kiwala

doi:10.3389/frai.2024.1446368

Frontiers in Artificial Intelligence (Jul 2024)

A unified Foot and Mouth Disease dataset for Uganda: evaluating machine learning predictive performance degradation under varying distributions

Geofrey Kapalaga,
Florence N. Kivunike,
Susan Kerfua,
Daudi Jjingo,
Daudi Jjingo,
Savino Biryomumaisho,
Justus Rutaisire,
Paul Ssajjakambwe,
Swidiq Mugerwa,
Yusuf Kiwala

Affiliations

Geofrey Kapalaga: Department of Information Technology, College of Computing and Information Sciences, Makerere University, Kampala, Uganda
Florence N. Kivunike: Department of Information Technology, College of Computing and Information Sciences, Makerere University, Kampala, Uganda
Susan Kerfua: National Livestock Resources Research Institute, Kampala, Uganda
Daudi Jjingo: African Center of Excellence in Bioinformatics (ACE-B), Makerere University, Kampala, Uganda
Daudi Jjingo: Department of Computer Science, College of Computing and Information Sciences, Makerere University, Kampala, Uganda
Savino Biryomumaisho: College of Veterinary Medicine, Animal Resources and Bio-Security, Makerere University, Kampala, Uganda
Justus Rutaisire: National Livestock Resources Research Institute, Kampala, Uganda
Paul Ssajjakambwe: National Livestock Resources Research Institute, Kampala, Uganda
Swidiq Mugerwa: National Livestock Resources Research Institute, Kampala, Uganda
Yusuf Kiwala: College of Business and Management Sciences, Makerere University, Kampala, Uganda

DOI: https://doi.org/10.3389/frai.2024.1446368
Journal volume & issue: Vol. 7

Abstract

Read online

In Uganda, the absence of a unified dataset for constructing machine learning models to predict Foot and Mouth Disease outbreaks hinders preparedness. Although machine learning models exhibit excellent predictive performance for Foot and Mouth Disease outbreaks under stationary conditions, they are susceptible to performance degradation in non-stationary environments. Rainfall and temperature are key factors influencing these outbreaks, and their variability due to climate change can significantly impact predictive performance. This study created a unified Foot and Mouth Disease dataset by integrating disparate sources and pre-processing data using mean imputation, duplicate removal, visualization, and merging techniques. To evaluate performance degradation, seven machine learning models were trained and assessed using metrics including accuracy, area under the receiver operating characteristic curve, recall, precision and F1-score. The dataset showed a significant class imbalance with more non-outbreaks than outbreaks, requiring data augmentation methods. Variability in rainfall and temperature impacted predictive performance, causing notable degradation. Random Forest with borderline SMOTE was the top-performing model in a stationary environment, achieving 92% accuracy, 0.97 area under the receiver operating characteristic curve, 0.94 recall, 0.90 precision, and 0.92 F1-score. However, under varying distributions, all models exhibited significant performance degradation, with random forest accuracy dropping to 46%, area under the receiver operating characteristic curve to 0.58, recall to 0.03, precision to 0.24, and F1-score to 0.06. This study underscores the creation of a unified Foot and Mouth Disease dataset for Uganda and reveals significant performance degradation in seven machine learning models under varying distributions. These findings highlight the need for new methods to address the impact of distribution variability on predictive performance.

Published in Frontiers in Artificial Intelligence

ISSN: 2624-8212 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.frontiersin.org/journals/artificial-intelligence#

About the journal

Abstract

Keywords