Upper Bounds on the Performance of Discretisation in Reinforcement Learning

Michael Robin Mitchley

doi:10.18489/sacj.v0i57.284

South African Computer Journal (Dec 2015)

Upper Bounds on the Performance of Discretisation in Reinforcement Learning

Michael Robin Mitchley

Affiliations

Michael Robin Mitchley: School of Computer Science and Applied Mathematics University of the Witwatersrand, Johannesburg

DOI: https://doi.org/10.18489/sacj.v0i57.284
Journal volume & issue: Vol. 0, no. 57

Abstract

Read online

Reinforcement learning is a machine learning framework whereby an agent learns to perform a task by maximising its total reward received for selecting actions in each state. The policy mapping states to actions that the agent learns is either represented explicitly, or implicitly through a value function. It is common in reinforcement learning to discretise a continuous state space using tile coding or binary features. We prove an upper bound on the performance of discretisation for direct policy representation or value function approximation.

Published in South African Computer Journal

ISSN: 1015-7999 (Print); 2313-7835 (Online)
Publisher: South African Institute of Computer Scientists and Information Technologists
Country of publisher: South Africa
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Management information systems; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://sacj.cs.uct.ac.za/

About the journal

Abstract

Keywords