A Version of the Euler Equation in Discounted Markov Decision Processes

H. Cruz-Suárez; G. Zacarías-Espinoza; V. Vázquez-Guevara

doi:10.1155/2012/103698

Journal of Applied Mathematics (Jan 2012)

A Version of the Euler Equation in Discounted Markov Decision Processes

H. Cruz-Suárez,
G. Zacarías-Espinoza,
V. Vázquez-Guevara

Affiliations

H. Cruz-Suárez: Facultad de Ciencias Físico Matemáticas, Benemérita Universidad Autónoma de Puebla, Avenida San Claudio y Río Verde, Col. San Manuel, CU, 72570 Puebla, PUE, Mexico
G. Zacarías-Espinoza: Facultad de Ciencias Físico Matemáticas, Benemérita Universidad Autónoma de Puebla, Avenida San Claudio y Río Verde, Col. San Manuel, CU, 72570 Puebla, PUE, Mexico
V. Vázquez-Guevara: Facultad de Ciencias Físico Matemáticas, Benemérita Universidad Autónoma de Puebla, Avenida San Claudio y Río Verde, Col. San Manuel, CU, 72570 Puebla, PUE, Mexico

DOI: https://doi.org/10.1155/2012/103698
Journal volume & issue: Vol. 2012

Abstract

Read online

This paper deals with Markov decision processes (MDPs) on Euclidean spaces with an infinite horizon. An approach to study this kind of MDPs is using the dynamic programming technique (DP). Then the optimal value function is characterized through the value iteration functions. The paper provides conditions that guarantee the convergence of maximizers of the value iteration functions to the optimal policy. Then, using the Euler equation and an envelope formula, the optimal solution of the optimal control problem is obtained. Finally, this theory is applied to a linear-quadratic control problem in order to find its optimal policy.

Published in Journal of Applied Mathematics

ISSN: 1110-757X (Print); 1687-0042 (Online)
Publisher: Hindawi Limited
Country of publisher: United Kingdom
LCC subjects: Science: Mathematics
Website: https://onlinelibrary.wiley.com/journal/4185

About the journal