Approximate Receding Horizon Approach for Markov Decision Processes: Average Reward Case

Chang, Hyeong Soo; Marcus, Steven I.

Approximate Receding Horizon Approach for Markov Decision Processes: Average Reward Case

Files

TR_2001-46.pdf (183.84 KB)

No. of downloads: 573

Date

2001

Authors

Chang, Hyeong Soo

Marcus, Steven I.

Abstract

Building on the receding horizon approach by Hernandez-Lerma andLasserre in solving Markov decision processes (MDPs),this paper first analyzes the performance of the (approximate) receding horizon approach in terms of infinite horizon average reward.

In this approach, we fix a finite horizon and at each decision time, we solve the given MDP with the finite horizon for an approximately optimal current action and take the action to control the MDP.

We then analyze recently proposed on-line policy improvementscheme, "roll-out," by Bertsekas and Castanon, and a generalization of the rollout algorithm, "parallel rollout" by Chang et al., in terms of the infinite horizon average reward in the framework of the (approximate) receding horizon control.

We finally discuss practical implementations of these schemes via simulation.

URI (handle)

http://hdl.handle.net/1903/6208

Collections

Institute for Systems Research Technical Reports

Full item page