A MARL Approach for Optimizing Positions of VANET Aerial Base-Stations on a Sparse Highway

Bote Jiang; Sidney N. Givigi; Jean-Alexis Delamer

doi:10.1109/ACCESS.2021.3108891

IEEE Access (Jan 2021)

A MARL Approach for Optimizing Positions of VANET Aerial Base-Stations on a Sparse Highway

Bote Jiang,
Sidney N. Givigi,
Jean-Alexis Delamer

Affiliations

Bote Jiang: ORCiD; School of Computing, Queen’s University, Kingston, Canada
Sidney N. Givigi: ORCiD; School of Computing, Queen’s University, Kingston, Canada
Jean-Alexis Delamer: ORCiD; Department of Computer Science, St. Francis Xavier University, Antigonish, Canada

DOI: https://doi.org/10.1109/ACCESS.2021.3108891
Journal volume & issue: Vol. 9
pp. 133989 – 134004

Abstract

Read online

A Vehicular Ad-Hoc Network (VANET) helps vehicles send and receive environmental and traffic information, making it a crucial component towards fully autonomous roads. For VANETs to serve their purpose, there has to be sufficient coverage, even in areas where there is less demand. Moreover, a lot of the safety information is time-sensitive; excessive delay in data transfer can increase the risk of fatal accidents. Unmanned Aerial Vehicles (UAVs) can be used as mobile base-stations to fill in gaps of coverage. The placement of these UAVs is crucial towards how well the system performs. We are particularly interested in the placement of mobile base-stations for a rural highway with sparse traffic, as it represents the worst-case scenario for vehicular communication. Instead of heuristic or linear programming methods for optimal placement, we use multi-agent reinforcement learning (MARL). The main benefit of MARL is that it allows the agents to learn model-free through experience. We propose a variation of the traditional Deep Independent Q-Learning. The modifications include an observation function augmented with information directly shared between neighbouring agents as well a shared policy scheme. We also implement a custom sparse highway simulator that is used for training and testing our algorithm. Our testing shows that the proposed MARL algorithm is able to learn the placement policies that produce the maximum rewards for different scenarios while adapting to the dynamic road densities along the service area. Our experiments show that our model is scalable, allowing the number of agents to increase without any modifications to the code. Finally, we show that our model can be generalized as the algorithm can be directly used on an industry standard simulator with similar performance. Future experiments can be performed to improve the realism and complexity of the highway models as well as to test the method on real-world data.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords