Journal of Systemics, Cybernetics and Informatics (Aug 2005)

Reinforcement Learning for a New Piano Mover

  • Yuko Ishiwaka,
  • Tomohiro Yoshida,
  • Yukinori Kakazu

Journal volume & issue
Vol. 3, no. 4
pp. 85 – 90

Abstract

Read online

We attempt to achieve corporative behavior of autonomous decentralized agents constructed via Q-Learning, which is a type of reinforcement learning. As such, in the present paper, we examine the piano mover's problem. We propose a multi-agent architecture that has a training agent, learning agents and intermediate agent. Learning agents are heterogeneous and can communicate with each other. The movement of an object with three kinds of agent depends on the composition of the actions of the learning agents. By learning its own shape through the learning agents, avoidance of obstacles by the object is expected. We simulate the proposed method in a two-dimensional continuous world. Results obtained in the present investigation reveal the effectiveness of the proposed method.

Keywords