Exploring Parameter Space in Reinforcement Learning

Rückstieß Thomas; Sehnke Frank; Schaul Tom; Wierstra Daan; Sun Yi; Schmidhuber Jürgen

doi:10.2478/s13230-010-0002-4

Paladyn (Mar 2010)

Exploring Parameter Space in Reinforcement Learning

Rückstieß Thomas,
Sehnke Frank,
Schaul Tom,
Wierstra Daan,
Sun Yi,
Schmidhuber Jürgen

Affiliations

Rückstieß Thomas: Technische Universität München, Institut für Informatik VI, Boltzmannstr. 3, 85748 Garching, Germany
Sehnke Frank: Technische Universität München, Institut für Informatik VI, Boltzmannstr. 3, 85748 Garching, Germany
Schaul Tom: Dalle Molle Institute for Artificial Intelligence (IDSIA), Galleria 2, 6928 Manno-Lugano, Switzerland
Wierstra Daan: Dalle Molle Institute for Artificial Intelligence (IDSIA), Galleria 2, 6928 Manno-Lugano, Switzerland
Sun Yi: Dalle Molle Institute for Artificial Intelligence (IDSIA), Galleria 2, 6928 Manno-Lugano, Switzerland
Schmidhuber Jürgen: Dalle Molle Institute for Artificial Intelligence (IDSIA), Galleria 2, 6928 Manno-Lugano, Switzerland

DOI: https://doi.org/10.2478/s13230-010-0002-4
Journal volume & issue: Vol. 1, no. 1
pp. 14 – 24

Abstract

Read online

This paper discusses parameter-based exploration methods for reinforcement learning. Parameter-based methods perturb parameters of a general function approximator directly, rather than adding noise to the resulting actions. Parameter-based exploration unifies reinforcement learning and black-box optimization, and has several advantages over action perturbation. We review two recent parameter-exploring algorithms: Natural Evolution Strategies and Policy Gradients with Parameter-Based Exploration. Both outperform state-of-the-art algorithms in several complex high-dimensional tasks commonly found in robot control. Furthermore, we describe how a novel exploration method, State-Dependent Exploration, can modify existing algorithms to mimic exploration in parameter space.

Published in Paladyn

ISSN: 2081-4836 (Online)
Publisher: De Gruyter
Country of publisher: Poland
LCC subjects: Technology
Website: https://www.degruyter.com/journal/key/pjbr/html

About the journal

Abstract

Keywords