Machine Learning with Applications (Sep 2024)
Prediction of bike-sharing station demand using explainable artificial intelligence
Abstract
Bike-sharing systems have grown in popularity in metropolitan areas, providing a handy and environmentally friendly transportation choice for commuters and visitors alike. As demand for bike-sharing programs grows, efficient capacity planning becomes critical to ensuring good user experience and system sustainability in terms of demand. The random forest model was used in this study to predict bike-sharing station demand and is considered a strong ensemble learning approach that can successfully capture complicated nonlinear correlations and interactions between input variables. This study employed data from the Smart Location Database (SLD) to test the model accuracy in estimating station demand and used a form of explainable artificial intelligence (XAI) function to further understand machine learning (ML) prediction outcomes owing to the blackbox tendencies of ML models. Vehicle Miles of Travel (VMT) and Greenhouse Gas (GHG) emissions were the most important features in predicting docking station demand individually but not holistically based on the datasets. The percentage of zero-car households, gross residential density, road network density, aggregate frequency of transit service, and gross activity density were found to have a moderate influence on the prediction model. Further, there may be a better prediction model generating sensible results for every type of explanatory variable, but their contributions are minimum to the prediction outcome. By measuring each feature's contribution to demand prediction in feature engineering, bike-sharing operators can acquire a better understanding of the bike-sharing station capacity and forecast future demands during planning. At the same time, ML models will need further assessment before a holistic conclusion.