Optimization in Markov decision problems with transition-dependent cost functions

Junjie Wang*, Xi Ren Cao

*Corresponding author for this work

Research output: Contribution to conferenceConference Paperpeer-review

Abstract

The traditional MDP deals with the cost function which only depends on the state, and the corresponding action. In the real world, however, there are many applications where the cost incurred depends on the particular transition as well, which makes the traditional MDP solution infeasible for these problems. In this paper, we apply the performance potential theory as an optimization tool for MDP. In particular, the notion of the expanded Markov chain is introduced to map this problem to a general form. Both computation-based and sample-path-based algorithms are developed for potential derivation. We address ourselves to the complexity-reduction techniques. Finally, we apply these techniques to the `Join the Shortest Queue' application, which is a significant component in the analysis of communication system.

Original languageEnglish
Pages1478-1483
Number of pages6
Publication statusPublished - 1999
Externally publishedYes
EventProceedings of the 1999 IEEE International Conference on Control Applications (CCA) and IEEE International Symposium on Computer Aided Control System Design (CACSD) - Kohala Coast, HI, USA
Duration: 22 Aug 199927 Aug 1999

Conference

ConferenceProceedings of the 1999 IEEE International Conference on Control Applications (CCA) and IEEE International Symposium on Computer Aided Control System Design (CACSD)
CityKohala Coast, HI, USA
Period22/08/9927/08/99

Fingerprint

Dive into the research topics of 'Optimization in Markov decision problems with transition-dependent cost functions'. Together they form a unique fingerprint.

Cite this