A New Adaptive Aggregation Algorithm for Infinite Horizon Dynamic Programming

Zhang, Chang; Baras, John S.

A New Adaptive Aggregation Algorithm for Infinite Horizon Dynamic Programming

Files

TR_2001-12.pdf (469.94 KB)

No. of downloads: 569

Date

2001

Authors

Zhang, Chang

Baras, John S.

Abstract

Dynamic programming suffers the "curse of dimensionality" when it isemployed for complex control systems. State aggregation is used to solvethe problem and acceleratecomputation by looking for a sub-optimal policy. In this paper, a new method, which converges much faster thanconventional aggregated value iteration based on TD(0), is proposed for computing the valuefunctions of theaggregated system. Preliminary results show that the new method increases thespeed of convergence impressively. Aggregation introduces errorsinevitably. An adaptive aggregation scheme employing the newcomputation method isalso proposed to reduce the aggregation errors.

URI (handle)

http://hdl.handle.net/1903/6228

Collections

Institute for Systems Research Technical Reports

Full item page