Implementation Issues for Markov Decision Processes.

dc.contributor.author	Makowski, Armand M.	en_US
dc.contributor.author	Shwartz, A.	en_US
dc.contributor.department	ISR	en_US
dc.date.accessioned	2007-05-23T09:35:49Z
dc.date.available	2007-05-23T09:35:49Z
dc.date.issued	1986	en_US
dc.description.abstract	In this paper, the problem of steering a long-run average coat functional to a prespecified value is discussed in the context of Markov decision processes wish countable statespace; this problem naturally arises in the study of constrained Markov decision processes by Lagrangian arguments. Under reasonable assumptions, a Markov stationary steering control is shown to exist, and to be obtained by fixed memoryless randomization between two Markov stationary policies. The implementability of this randomized policy is investigated in view of the fact that the randomization bias is solution to a (highly) nonlinear equation, which may not even be available in the absence of full knowledge of the model parameter values. Several proposals for implementation are made and their relative properties discussed. The paper closes with an outline of a methodology that was found useful in investigating properties of Certainty Equivalence implementations.	en_US
dc.format.extent	835877 bytes
dc.format.mimetype	application/pdf
dc.identifier.uri	http://hdl.handle.net/1903/4488
dc.language.iso	en_US	en_US
dc.relation.ispartofseries	ISR; TR 1986-63	en_US
dc.title	Implementation Issues for Markov Decision Processes.	en_US
dc.type	Technical Report	en_US

Files

Now showing 1 - 1 of 1