Convergence of Sample Path Optimal Policies for Stochastic Dynamic Programming

dc.contributor.advisor	Fu, Michael C.	en_US
dc.contributor.author	Fu, Michael C.	en_US
dc.contributor.author	Jin, Xing	en_US
dc.contributor.department	ISR	en_US
dc.date.accessioned	2007-05-23T10:17:46Z
dc.date.available	2007-05-23T10:17:46Z
dc.date.issued	2005	en_US
dc.description.abstract	We consider the solution of stochastic dynamic programs using sample path estimates. Applying the theory of large deviations, we derive probability error bounds associated with the convergence of the estimated optimal policy to the true optimal policy, for finite horizon problems. These bounds decay at an exponential rate, in contrast with the usual canonical (inverse) square root rate associated with estimation of the value (cost-to-go) function itself. These results have practical implications for Monte Carlo simulation-based solution approaches to stochastic dynamic programming problems where it is impractical to extract the explicit transition probabilities of the underlying system model.	en_US
dc.format.extent	186156 bytes
dc.format.mimetype	application/pdf
dc.identifier.uri	http://hdl.handle.net/1903/6548
dc.language.iso	en_US	en_US
dc.relation.ispartofseries	ISR; TR 2005-84	en_US
dc.title	Convergence of Sample Path Optimal Policies for Stochastic Dynamic Programming	en_US
dc.type	Technical Report	en_US

Files

Now showing 1 - 1 of 1