A Fresh Look at Markov Decision Processes.

dc.contributor.authorBorkar, Vivek S.en_US
dc.contributor.departmentISRen_US
dc.date.accessioned2007-05-23T09:36:40Z
dc.date.available2007-05-23T09:36:40Z
dc.date.issued1987en_US
dc.description.abstractThis paper develops a new framework for the study of Markov decision processes in which the control problem is viewed as an optimization problem on the set of canonically induced measures on the trajectory space of the joint state and control process. This set is shown to be compact convex. One then associates with each of the usual cost criteria (infinite horizon discounted cost, finite horizon, control up to an exit time) a naturally defined occupation measure such that the cost is an integral of some function with reapect to this measure. These measures are shown to form a compact convex set whose extreme points are characterized. Classical results about existence of optimal strategies are recovered from this and several applications to multicriteria and constrained optimization problems are briefly indicated.en_US
dc.format.extent908129 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/1903/4535
dc.language.isoen_USen_US
dc.relation.ispartofseriesISR; TR 1987-26en_US
dc.titleA Fresh Look at Markov Decision Processes.en_US
dc.typeTechnical Reporten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TR_87-26.pdf
Size:
886.84 KB
Format:
Adobe Portable Document Format