Deterministic Discounted Markov Decision Processes with Fuzzy Rewards/Costs

Hugo Cruz-Suárez; Raúl Montes-de-Oca; R. Israel Ortega-Gutiérrez

doi:10.26599/FIE.2023.9270020

Fuzzy Information and Engineering (Sep 2023)

Deterministic Discounted Markov Decision Processes with Fuzzy Rewards/Costs

Hugo Cruz-Suárez,
Raúl Montes-de-Oca,
R. Israel Ortega-Gutiérrez

Affiliations

Hugo Cruz-Suárez: Facultad de Ciencias Físico Matemáticas, Benemérita Universidad Autónoma de Puebla, Puebla 72570, México
Raúl Montes-de-Oca: Departamento de Matemáticas, Universidad Autónoma Metropolitana-Iztapalapa, CDMX 09340, México
R. Israel Ortega-Gutiérrez: Facultad de Ciencias Físico Matemáticas, Benemérita Universidad Autónoma de Puebla, Puebla 72570, México

DOI: https://doi.org/10.26599/FIE.2023.9270020
Journal volume & issue: Vol. 15, no. 3
pp. 274 – 290

Abstract

Read online

The article concerns a study of infinite-horizon deterministic Markov decision processes (MDPs) for which the fuzzy environment will be presented through considering these MDPs with both fuzzy rewards and fuzzy costs. Specifically, these rewards and costs will be assumed of a suitable trapezoidal type. For both classes of MDPs, i.e., MDPs with fuzzy rewards and MDPs with fuzzy costs, the fuzzy total discounted function will be taken into account as the objective function, and the corresponding optimal decision problems will be considered with respect to the max order of the fuzzy numbers. For each optimal decision problem, the optimal policy and the optimal value function are related and obtained as a solution of a convenient standard MDP (i.e., a standard MDP is an MDP with a non-fuzzy reward function or a non-fuzzy cost function). Moreover, an economic growth model (EGM), a deterministic version of the linear-quadratic model (LQM), and an optimal consumption model (OCM) in order to clarify the theory presented are given, and it is remarked that these models have uncountable state spaces, and the corresponding non-fuzzy version of both the EGM and the OCM has an unbounded reward function, and the corresponding non-fuzzy version of the LQM has an unbounded cost function.

Published in Fuzzy Information and Engineering

ISSN: 1616-8658 (Print); 1616-8666 (Online)
Publisher: Tsinghua University Press
Country of publisher: China
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Mathematics
Website: https://www.sciopen.com/journal/1616-8658

About the journal

Abstract

Keywords