Optimal Risk Sensitive Control of Semi-Markov Decision Processes

Chawla, Jay P.

Optimal Risk Sensitive Control of Semi-Markov Decision Processes

dc.contributor.advisor	Marcus, Steven I.	en_US
dc.contributor.advisor	Shayman, Mark A.	en_US
dc.contributor.author	Chawla, Jay P.	en_US
dc.contributor.department	ISR	en_US
dc.date.accessioned	2007-05-23T10:09:43Z
dc.date.available	2007-05-23T10:09:43Z
dc.date.issued	2000	en_US
dc.description.abstract	In this thesis, we study risk-sensitive cost minimization in semi-Markov decision processes. The main thrust of the thesis concerns the minimization of average risk sensitive costs over the infinite horizon. <p>Existing theory is expanded in two directions: the semi-Markov case is considered, and non-irreducible chains are considered. In particular, the analysis of the non-irreducible case is a significant addition to the literature, since many real-world systems do not exhibit irreducibility under all stationary Markov policies. Extension of existing results to the semi-Markov case is significant because it requires the definition of a new dynamic programming equation and a technically challenging adaptation of the Perron-Frobeniuseigen value from the discrete time case.<p>In order to determine an optimal policy, new concepts in the classification of Markov chains need to be introduced. This is because in the non-irreducible case, the average risk sensitive cost objective function permits extremely unlikely events to exert a controlling influence on costs. We define equivalence classes of states called 'strongly communicating classes' and formulate in terms of them a new characterization of the underlying structure of Markov Decision Problems and Markov chains.<p>In the risk sensitive case, the expected cost incurred prior to a stopping time with finite expected value can be infinite. For this reason, we introduce an assumption: reachability with finite cost. This is the fundamental assumption required to achieve the major results of this thesis.<p>We explore existence conditions for an optimal policy, optimality equations, and behavior for large and small risk sensitivity parameter. (Only non-negative risk parameters are discussed in this thesis -- i.e. the risk averse and risk neutral cases, not the risk seeking case.) Ramifications for the risk neutral objective function are also analyzed. Furthermore, a simple solution technique we call 'recursive computation' to find an optimal policy that is applicable to small state spaces is described through examples.<p>The countable state space case is explored, and results that hold only for a finite state space are also presented. Other, related objective functions such as sample path cost are analyzed and discussed.<p>We also explore finite time horizon semi-Markov problems, and present a general technique for solving them. We define a new objective function, the minimization of which is called the 'deadline problem'. This is a problem in which the probability of reaching the goal state in a set period of time is maximized. We transform the deadline problem objective function into an equivalent finite-horizon risk sensitive objective function.	en_US
dc.format.extent	1134429 bytes
dc.format.mimetype	application/pdf
dc.identifier.uri	http://hdl.handle.net/1903/6150
dc.language.iso	en_US	en_US
dc.relation.ispartofseries	ISR; PhD 2000-8	en_US
dc.subject	queueing networks	en_US
dc.subject	optimal control	en_US
dc.subject	discrete event dynamical systems DEDS	en_US
dc.subject	risk sensitive	en_US
dc.subject	semi-Markov	en_US
dc.subject	MDP	en_US
dc.subject	Markov decision process	en_US
dc.subject	optimal control	en_US
dc.subject	stochastic control	en_US
dc.subject	Perron-Frobenius	en_US
dc.subject	reducible	en_US
dc.subject	Markov	en_US
dc.subject	realization	en_US
dc.subject	Global Communication Systems	en_US
dc.title	Optimal Risk Sensitive Control of Semi-Markov Decision Processes	en_US
dc.type	Dissertation	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: PhD_2000-8.pdf
Size:: 1.08 MB
Format:: Adobe Portable Document Format

Download

Collections

Institute for Systems Research Technical Reports