Journal of Management Science and Engineering (Jun 2022)

Multi-objective vehicle rebalancing for ridehailing system using a reinforcement learning approach

  • Yuntian Deng,
  • Hao Chen,
  • Shiping Shao,
  • Jiacheng Tang,
  • Jianzong Pi,
  • Abhishek Gupta

Journal volume & issue
Vol. 7, no. 2
pp. 346 – 364

Abstract

Read online

The problem of designing a rebalancing algorithm for a large-scale ridehailing system with asymmetric (unbalanced) demand is considered here. We pose the rebalancing problem within a semi Markov decision problem (SMDP) framework with closed queues of vehicles serving stationary, but asymmetric demand, over a large city with multiple stations (representing neighborhoods). We assume that the passengers queue up at every station until they are matched with a vehicle. The goal of the SMDP is to minimize a convex combination of the waiting time of the passengers and the total empty vehicle miles traveled. The resulting SMDP appears to be difficult to solve yielding closed-form expression for the optimal rebalancing strategy. Consequently, we use a deep reinforcement learning algorithm to determine the approximately optimal solution to the SMDP. We show through extensive Monte Carlo simulations that the trained policy outperforms other well-known state-dependent rebalancing strategies.

Keywords