Semi-Markov decision problems and performance sensitivity analysis

Xi Ren Cao*

*Corresponding author for this work

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

2 Citations (Scopus)

Abstract

We extend the results about performance potentials, perturbation realization matrices, policy iteration of Markov decision processes, etc., to semi-Markov processes (SMPs). Starting with the concept of perturbation realization, we define a realization matrix and prove that it satisfies the Lyapunov equation. From the realization matrix we define a performance potential and prove that it satisfies the Poisson equation. Sensitivity formulas and policy iteration algorithms of Semi-Markov decision process (SMDPs) can be derived. The performance sensitivities can be obtained and policy iteration of SMDPs can be implemeted on a single sample path of the SMPs.

Original languageEnglish
Title of host publicationIFAC Proceedings Volumes (IFAC-PapersOnline)
EditorsGabriel Ferrate, Eduardo F. Camacho, Luis Basanez, Juan. A. de la Puente
PublisherIFAC Secretariat
Pages139-143
Number of pages5
Edition1
ISBN (Print)9783902661746
DOIs
Publication statusPublished - 2002
Event15th World Congress of the International Federation of Automatic Control, 2002 - Barcelona, Spain
Duration: 21 Jul 200226 Jul 2002

Publication series

NameIFAC Proceedings Volumes (IFAC-PapersOnline)
Number1
Volume15
ISSN (Print)1474-6670

Conference

Conference15th World Congress of the International Federation of Automatic Control, 2002
Country/TerritorySpain
CityBarcelona
Period21/07/0226/07/02

Bibliographical note

Publisher Copyright:
Copyright © 2002 IFAC.

Keywords

  • Lyapunov equations
  • Perturbation analysis
  • Poisson equations
  • Policy iteration
  • Potentials

Fingerprint

Dive into the research topics of 'Semi-Markov decision problems and performance sensitivity analysis'. Together they form a unique fingerprint.

Cite this