Data in Brief (Aug 2024)
Daily electric vehicle charging dataset for training reinforcement learning algorithms
Abstract
Reinforcement learning algorithms are increasingly utilized across diverse domains within power systems. One notable challenge in training and deploying these algorithms is the acquisition of large, realistic datasets. It is imperative that these algorithms are trained on extensive, realistic datasets over numerous iterations to ensure optimal performance in real-world scenarios. In pursuit of this goal, we curated a comprehensive dataset capturing electric vehicle (EV) charging details over a span of 29,600 days within a designated parking facility. This dataset encompasses necessary information such as connection times, charging durations, and energy consumption of individual EVs. The methodology involved employing conditional tabular generative adversarial networks (CTGAN) to craft a pool of synthetic dataset from a smaller initial dataset collected from an EV charging facility located on the Caltech campus. Subsequently, multiple post-processing techniques were implemented to extract data from this pool, ensuring compliance with the charging station's capacity constraint while maintaining a realistic daily EV demand profile derived from historical data. Using kernel density estimation (KDE), the distributional characteristics of the historical data, especially concerning the timing of EV connections, were faithfully replicated. The developed dataset is specifically useful in training offline reinforcement learning algorithms.