Model-Free All-Source-All-Destination Learning as a Model for Biological Reactive Control

Martinius Knudsen; Sverre Hendseth; Gunnar Tufte; Axel Sandvig

doi:10.4173/mic.2021.4.5

Modeling, Identification and Control (Oct 2021)

Model-Free All-Source-All-Destination Learning as a Model for Biological Reactive Control

Martinius Knudsen,
Sverre Hendseth,
Gunnar Tufte,
Axel Sandvig

Affiliations

Martinius Knudsen
Sverre Hendseth
Gunnar Tufte
Axel Sandvig

DOI: https://doi.org/10.4173/mic.2021.4.5
Journal volume & issue: Vol. 42, no. 4
pp. 197 – 204

Abstract

Read online

We present here a model-free method for learning actions that lead to an all-source-all-destination shortest path solution. We motivate our approach in the context of biological learning for reactive control. Our method involves an agent exploring an unknown world with the objective of learning how to get from any starting state to any goal state in shortest time without having to run a path planning algorithm for each new goal selection. Using concepts of Lyapunov functions and Bellman's principle of optimality, our agent learns universal state-goal distances and best actions that solve this problem.

Published in Modeling, Identification and Control

ISSN: 0332-7353 (Print); 1890-1328 (Online)
Publisher: Norwegian Society of Automatic Control
Country of publisher: Norway
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.mic-journal.no/

About the journal

Abstract

Keywords