An Asymptotically Efficient Algorithm for Finite Horizon Stochastic Dynamic Programming Problems
dc.contributor.advisor | Fu, Michael C. | en_US |
dc.contributor.advisor | Marcus, Steven I. | en_US |
dc.contributor.author | Chang, Hyeong Soo | en_US |
dc.contributor.author | Fu, Michael C. | en_US |
dc.contributor.author | Marcus, Steven I. | en_US |
dc.contributor.department | ISR | en_US |
dc.date.accessioned | 2007-05-23T10:13:47Z | |
dc.date.available | 2007-05-23T10:13:47Z | |
dc.date.issued | 2003 | en_US |
dc.description.abstract | We present a novel algorithm, called ``Simulated Annealing Multiplicative Weights", for approximately solving large finite-horizon stochastic dynamic programming problems. The algorithm is ``asymptotically efficient" in the sense that a finite-time bound for the sample mean of the optimal value function over a given finite policy space can be obtained, and the bound approaches the optimal value as the number of iterations increases.The algorithm updates a probability distribution over the given policy space with a very simple rule, and the sequence of distributions generated by the algorithm converges to a distribution concentrated only on the optimal policies for the given policy space. We also discuss how to reduce the computational cost of the algorithm to apply it in practice. | en_US |
dc.format.extent | 259013 bytes | |
dc.format.mimetype | application/pdf | |
dc.identifier.uri | http://hdl.handle.net/1903/6358 | |
dc.language.iso | en_US | en_US |
dc.relation.ispartofseries | ISR; TR 2003-26 | en_US |
dc.subject | NULL | en_US |
dc.title | An Asymptotically Efficient Algorithm for Finite Horizon Stochastic Dynamic Programming Problems | en_US |
dc.type | Technical Report | en_US |
Files
Original bundle
1 - 1 of 1