Real-world challenges for multi-agent reinforcement learning in grid-interactive buildings

Kingsley Nweye; Bo Liu; Peter Stone; Zoltan Nagy

Energy and AI (Nov 2022)

Real-world challenges for multi-agent reinforcement learning in grid-interactive buildings

Kingsley Nweye,
Bo Liu,
Peter Stone,
Zoltan Nagy

Affiliations

Kingsley Nweye: Intelligent Environments Laboratory, Department of Civil, Architectural and Environmental Engineering, The University of Texas at Austin, 301 E. Dean Keeton St., ECJ 4.200, Austin, 78712-1700, TX, USA
Bo Liu: Department of Computer Science, The University of Texas at Austin, 2317 Speedway, GDC 2.302, Austin, 78712-1700, TX, USA
Peter Stone: Department of Computer Science, The University of Texas at Austin, 2317 Speedway, GDC 2.302, Austin, 78712-1700, TX, USA
Zoltan Nagy: Intelligent Environments Laboratory, Department of Civil, Architectural and Environmental Engineering, The University of Texas at Austin, 301 E. Dean Keeton St., ECJ 4.200, Austin, 78712-1700, TX, USA; Corresponding author.

Journal volume & issue: Vol. 10
p. 100202

Abstract

Read online

Building upon prior research that highlighted the need for standardizing environments for building control research, and inspired by recently introduced challenges for real life reinforcement learning (RL) control, here we propose a non-exhaustive set of nine real world challenges for RL control in grid-interactive buildings (GIBs). We argue that research in this area should be expressed in this framework in addition to providing a standardized environment for repeatability. Advanced controllers such as model predictive control (MPC) and RL control have both advantages and disadvantages that prevent them from being implemented in real world problems. Comparisons between the two are rare, and often biased. By focusing on the challenges, we can investigate the performance of the controllers under a variety of situations and generate a fair comparison. As a demonstration, we implement the offline learning challenge in CityLearn, an OpenAI Gym environment for the easy implementation of RL agents in a demand response setting to reshape the aggregated curve of electricity demand by controlling the energy storage of a diverse set of buildings in a district. We use CityLearn to study the impact of different levels of domain knowledge and complexity of RL algorithms and show that the sequence of operations (SOOs) utilized in a rule based controller (RBC) that provides fixed logs to RL agents during offline training affect the performance of the agents when evaluated on a set of four energy flexibility metrics. Longer offline training from an optimized RBC leads to improved performance in the long run. RL agents that train on the logs from a simplified RBC risk poorer performance as the offline training period increases. We also observe no impact on performance from information sharing amongst agents. We call for a more interdisciplinary effort of the research community to address the real world challenges, and unlock the potential of GIB controllers.

Published in Energy and AI

ISSN: 2666-5468 (Online)
Publisher: Elsevier
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://www.journals.elsevier.com/energy-and-ai

About the journal

Abstract

Keywords