An Asymptotically Efficient Algorithm for Finite Horizon Stochastic Dynamic Programming Problems

dc.contributor.advisor	Fu, Michael C.	en_US
dc.contributor.advisor	Marcus, Steven I.	en_US
dc.contributor.author	Chang, Hyeong Soo	en_US
dc.contributor.author	Fu, Michael C.	en_US
dc.contributor.author	Marcus, Steven I.	en_US
dc.contributor.department	ISR	en_US
dc.date.accessioned	2007-05-23T10:13:47Z
dc.date.available	2007-05-23T10:13:47Z
dc.date.issued	2003	en_US
dc.description.abstract	We present a novel algorithm, called ``Simulated Annealing Multiplicative Weights", for approximately solving large finite-horizon stochastic dynamic programming problems. The algorithm is ``asymptotically efficient" in the sense that a finite-time bound for the sample mean of the optimal value function over a given finite policy space can be obtained, and the bound approaches the optimal value as the number of iterations increases.The algorithm updates a probability distribution over the given policy space with a very simple rule, and the sequence of distributions generated by the algorithm converges to a distribution concentrated only on the optimal policies for the given policy space. We also discuss how to reduce the computational cost of the algorithm to apply it in practice.	en_US
dc.format.extent	259013 bytes
dc.format.mimetype	application/pdf
dc.identifier.uri	http://hdl.handle.net/1903/6358
dc.language.iso	en_US	en_US
dc.relation.ispartofseries	ISR; TR 2003-26	en_US
dc.subject	NULL	en_US
dc.title	An Asymptotically Efficient Algorithm for Finite Horizon Stochastic Dynamic Programming Problems	en_US
dc.type	Technical Report	en_US

Files

Now showing 1 - 1 of 1