Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning

Georgia Chalvatzaki; Georgia Chalvatzaki; Georgia Chalvatzaki; Ali Younes; Daljeet Nandha; An Thai Le; Leonardo F. R. Ribeiro; Iryna Gurevych; Iryna Gurevych

doi:10.3389/frobt.2023.1221739

Frontiers in Robotics and AI (Aug 2023)

Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning

Georgia Chalvatzaki,
Georgia Chalvatzaki,
Georgia Chalvatzaki,
Ali Younes,
Daljeet Nandha,
An Thai Le,
Leonardo F. R. Ribeiro,
Iryna Gurevych,
Iryna Gurevych

Affiliations

Georgia Chalvatzaki: Computer Science Department, Technische Universität Darmstadt, Darmstadt, Germany
Georgia Chalvatzaki: Hessian.AI, Darmstadt, Germany
Georgia Chalvatzaki: Center for Mind, Brain and Behavior, University Marburg and JLU Giessen, Marburg, Germany
Ali Younes: Computer Science Department, Technische Universität Darmstadt, Darmstadt, Germany
Daljeet Nandha: Computer Science Department, Technische Universität Darmstadt, Darmstadt, Germany
An Thai Le: Computer Science Department, Technische Universität Darmstadt, Darmstadt, Germany
Leonardo F. R. Ribeiro: Amazon Alexa, Seattle, WA, United States
Iryna Gurevych: Computer Science Department, Technische Universität Darmstadt, Darmstadt, Germany
Iryna Gurevych: Hessian.AI, Darmstadt, Germany

DOI: https://doi.org/10.3389/frobt.2023.1221739
Journal volume & issue: Vol. 10

Abstract

Read online

Long-horizon task planning is essential for the development of intelligent assistive and service robots. In this work, we investigate the applicability of a smaller class of large language models (LLMs), specifically GPT-2, in robotic task planning by learning to decompose tasks into subgoal specifications for a planner to execute sequentially. Our method grounds the input of the LLM on the domain that is represented as a scene graph, enabling it to translate human requests into executable robot plans, thereby learning to reason over long-horizon tasks, as encountered in the ALFRED benchmark. We compare our approach with classical planning and baseline methods to examine the applicability and generalizability of LLM-based planners. Our findings suggest that the knowledge stored in an LLM can be effectively grounded to perform long-horizon task planning, demonstrating the promising potential for the future application of neuro-symbolic planning methods in robotics.

Published in Frontiers in Robotics and AI

ISSN: 2296-9144 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Technology: Mechanical engineering and machinery; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.frontiersin.org/journals/robotics-and-ai

About the journal

Abstract

Keywords