Compositional RL Agents That Follow Language Commands in Temporal Logic

Yen-Ling Kuo; Yen-Ling Kuo; Boris Katz; Boris Katz; Andrei Barbu; Andrei Barbu

doi:10.3389/frobt.2021.689550

Frontiers in Robotics and AI (Jul 2021)

Compositional RL Agents That Follow Language Commands in Temporal Logic

Yen-Ling Kuo,
Yen-Ling Kuo,
Boris Katz,
Boris Katz,
Andrei Barbu,
Andrei Barbu

Affiliations

Yen-Ling Kuo: CSAIL, MIT, Cambridge, MA, Unites States
Yen-Ling Kuo: CBMM, MIT, Cambridge, MA, United States
Boris Katz: CSAIL, MIT, Cambridge, MA, Unites States
Boris Katz: CBMM, MIT, Cambridge, MA, United States
Andrei Barbu: CSAIL, MIT, Cambridge, MA, Unites States
Andrei Barbu: CBMM, MIT, Cambridge, MA, United States

DOI: https://doi.org/10.3389/frobt.2021.689550
Journal volume & issue: Vol. 8

Abstract

Read online

We demonstrate how a reinforcement learning agent can use compositional recurrent neural networks to learn to carry out commands specified in linear temporal logic (LTL). Our approach takes as input an LTL formula, structures a deep network according to the parse of the formula, and determines satisfying actions. This compositional structure of the network enables zero-shot generalization to significantly more complex unseen formulas. We demonstrate this ability in multiple problem domains with both discrete and continuous state-action spaces. In a symbolic domain, the agent finds a sequence of letters that satisfy a specification. In a Minecraft-like environment, the agent finds a sequence of actions that conform to a formula. In the Fetch environment, the robot finds a sequence of arm configurations that move blocks on a table to fulfill the commands. While most prior work can learn to execute one formula reliably, we develop a novel form of multi-task learning for RL agents that allows them to learn from a diverse set of tasks and generalize to a new set of diverse tasks without any additional training. The compositional structures presented here are not specific to LTL, thus opening the path to RL agents that perform zero-shot generalization in other compositional domains.

Published in Frontiers in Robotics and AI

ISSN: 2296-9144 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Technology: Mechanical engineering and machinery; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.frontiersin.org/journals/robotics-and-ai

About the journal

Abstract

Keywords