Approximate Policy Iteration for Semiconductor Fab-Level Decision Making - a Case Study
dc.contributor.advisor | Marcus, Steven I. | en_US |
dc.contributor.author | He, Ying | en_US |
dc.contributor.author | Bhatnagar, Shalabh | en_US |
dc.contributor.author | Fu, Michael C. | en_US |
dc.contributor.author | Marcus, Steven I. | en_US |
dc.contributor.department | ISR | en_US |
dc.date.accessioned | 2007-05-23T10:09:32Z | |
dc.date.available | 2007-05-23T10:09:32Z | |
dc.date.issued | 2000 | en_US |
dc.description.abstract | In this paper, we propose an approximate policy iteration (API) algorithm for asemiconductor fab-level decision making problem. This problem is formulated as adiscounted cost Markov Decision Process (MDP), and we have applied exact policy iterationto solve a simple example in prior work. However, the overwhelmingcomputational requirements of exact policy iteration prevent its application forlarger problems. Approximate policy iteration overcomes this obstacle by approximating thecost-to-go using function approximation. Numerical simulation on the same example showsthat the proposed API algorithm leads to a policy with cost close to that of the optimalpolicy. | en_US |
dc.format.extent | 283099 bytes | |
dc.format.mimetype | application/pdf | |
dc.identifier.uri | http://hdl.handle.net/1903/6142 | |
dc.language.iso | en_US | en_US |
dc.relation.ispartofseries | ISR; TR 2000-49 | en_US |
dc.subject | Approximate Policy Iteration | en_US |
dc.subject | Semiconductor Fab-Level Decision Making | en_US |
dc.subject | Markov Decision Processes | en_US |
dc.subject | Discounted Cost Problem | en_US |
dc.subject | Sensor-Actuator Networks | en_US |
dc.subject | Next-Generation Product Realization Systems | en_US |
dc.title | Approximate Policy Iteration for Semiconductor Fab-Level Decision Making - a Case Study | en_US |
dc.type | Technical Report | en_US |
Files
Original bundle
1 - 1 of 1