Abstract
We extend the results about performance potentials, perturbation realization matrices, policy iteration of Markov decision processes, etc., to semi-Markov processes (SMPs). Starting with the concept of perturbation realization, we define a realization matrix and prove that it satisfies the Lyapunov equation. From the realization matrix we define a performance potential and prove that it satisfies the Poisson equation. Sensitivity formulas and policy iteration algorithms of Semi-Markov decision process (SMDPs) can be derived. The performance sensitivities can be obtained and policy iteration of SMDPs can be implemeted on a single sample path of the SMPs.
| Original language | English |
|---|---|
| Title of host publication | IFAC Proceedings Volumes (IFAC-PapersOnline) |
| Editors | Gabriel Ferrate, Eduardo F. Camacho, Luis Basanez, Juan. A. de la Puente |
| Publisher | IFAC Secretariat |
| Pages | 139-143 |
| Number of pages | 5 |
| Edition | 1 |
| ISBN (Print) | 9783902661746 |
| DOIs | |
| Publication status | Published - 2002 |
| Event | 15th World Congress of the International Federation of Automatic Control, 2002 - Barcelona, Spain Duration: 21 Jul 2002 → 26 Jul 2002 |
Publication series
| Name | IFAC Proceedings Volumes (IFAC-PapersOnline) |
|---|---|
| Number | 1 |
| Volume | 15 |
| ISSN (Print) | 1474-6670 |
Conference
| Conference | 15th World Congress of the International Federation of Automatic Control, 2002 |
|---|---|
| Country/Territory | Spain |
| City | Barcelona |
| Period | 21/07/02 → 26/07/02 |
Bibliographical note
Publisher Copyright:Copyright © 2002 IFAC.
Keywords
- Lyapunov equations
- Perturbation analysis
- Poisson equations
- Policy iteration
- Potentials
Fingerprint
Dive into the research topics of 'Semi-Markov decision problems and performance sensitivity analysis'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver