Prediction of bike-sharing station demand using explainable artificial intelligence

Frank Ngeni; Boniphace Kutela; Tumlumbe Juliana Chengula; Cuthbert Ruseruka; Hannah Musau; Norris Novat; Debbie Aisiana Indah; Sarah Kasomi

Machine Learning with Applications (Sep 2024)

Prediction of bike-sharing station demand using explainable artificial intelligence

Frank Ngeni,
Boniphace Kutela,
Tumlumbe Juliana Chengula,
Cuthbert Ruseruka,
Hannah Musau,
Norris Novat,
Debbie Aisiana Indah,
Sarah Kasomi

Affiliations

Frank Ngeni: South Carolina State University, Orangeburg, SC 29117, USA; Corresponding author.
Boniphace Kutela: Texas A&M Transportation Institute, Houston, TX 77024, USA
Tumlumbe Juliana Chengula: South Carolina State University, Orangeburg, SC 29117, USA
Cuthbert Ruseruka: South Carolina State University, Orangeburg, SC 29117, USA
Hannah Musau: South Carolina State University, Orangeburg, SC 29117, USA
Norris Novat: Leidos Inc. STOL-Turner Fairbank Highway Research Center, 6300 Georgetown Pike, McLean, VA 22101, USA
Debbie Aisiana Indah: South Carolina State University, Orangeburg, SC 29117, USA
Sarah Kasomi: HDR Inc., Jacksonville, FL 32202, USA

Journal volume & issue: Vol. 17
p. 100582

Abstract

Read online

Bike-sharing systems have grown in popularity in metropolitan areas, providing a handy and environmentally friendly transportation choice for commuters and visitors alike. As demand for bike-sharing programs grows, efficient capacity planning becomes critical to ensuring good user experience and system sustainability in terms of demand. The random forest model was used in this study to predict bike-sharing station demand and is considered a strong ensemble learning approach that can successfully capture complicated nonlinear correlations and interactions between input variables. This study employed data from the Smart Location Database (SLD) to test the model accuracy in estimating station demand and used a form of explainable artificial intelligence (XAI) function to further understand machine learning (ML) prediction outcomes owing to the blackbox tendencies of ML models. Vehicle Miles of Travel (VMT) and Greenhouse Gas (GHG) emissions were the most important features in predicting docking station demand individually but not holistically based on the datasets. The percentage of zero-car households, gross residential density, road network density, aggregate frequency of transit service, and gross activity density were found to have a moderate influence on the prediction model. Further, there may be a better prediction model generating sensible results for every type of explanatory variable, but their contributions are minimum to the prediction outcome. By measuring each feature's contribution to demand prediction in feature engineering, bike-sharing operators can acquire a better understanding of the bike-sharing station capacity and forecast future demands during planning. At the same time, ML models will need further assessment before a holistic conclusion.

Published in Machine Learning with Applications

ISSN: 2666-8270 (Online)
Publisher: Elsevier
Country of publisher: United Kingdom
LCC subjects: Science: Science (General): Cybernetics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.journals.elsevier.com/machine-learning-with-applications

About the journal

Abstract

Keywords