Mathematics (Jun 2022)

Optimal Control with Partially Observed Regime Switching: Discounted and Average Payoffs

  • Beatris Adriana Escobedo-Trujillo,
  • Javier Garrido-Meléndez,
  • Gerardo Alcalá,
  • J. D. Revuelta-Acosta

DOI
https://doi.org/10.3390/math10122073
Journal volume & issue
Vol. 10, no. 12
p. 2073

Abstract

Read online

We consider an optimal control problem with the discounted and average payoff. The reward rate (or cost rate) can be unbounded from above and below, and a Markovian switching stochastic differential equation gives the state variable dynamic. Markovian switching is represented by a hidden continuous-time Markov chain that can only be observed in Gaussian white noise. Our general aim is to give conditions for the existence of optimal Markov stationary controls. This fact generalizes the conditions that ensure the existence of optimal control policies for optimal control problems completely observed. We use standard dynamic programming techniques and the method of hidden Markov model filtering to achieve our goals. As applications of our results, we study the discounted linear quadratic regulator (LQR) problem, the ergodic LQR problem for the modeled quarter-car suspension, the average LQR problem for the modeled quarter-car suspension with damp, and an explicit application for an optimal pollution control.

Keywords