Search
Now showing items 21-30 of 45
Risk-Sensitive Optimal Control of Hidden Markov Models: Structural Results
(1996)
We consider a risk-sensitive optimal control problem for hidden Markov models (HMM), i.e. controlled Markov chains where state information is only available to the controller via an output (message) process. Building upon ...
An Asymptotically Efficient Algorithm for Finite Horizon Stochastic Dynamic Programming Problems
(2003)
We present a novel algorithm, called ``Simulated Annealing Multiplicative Weights", for approximately solving large finite-horizon stochastic dynamic programming problems. The algorithm is ``asymptotically efficient" in ...
Solving Continuous-State POMDPs via Density Projection
(2007)
Research on numerical solution methods for partially observable Markov decision processes (POMDPs) has primarily focused on discrete-state models, and these algorithms do not generally extend to continuous-state POMDPs, ...
Multi-time Scale Markov Decision Processes
(2002)
This paper proposes a simple analytical model called M time-scale MarkovDecision Process (MMDP) for hierarchically structured sequential decision making processes, where decisions in each level in the M-level hierarchy are ...
Markov Games: Receding Horizon Approach
(2001)
We consider a receding horizon approach as an approximate solution totwo-person zero-sum Markov games with infinite horizon discounted costand average cost criteria. <p>We first present error bounds from the optimalequilibrium ...
Coloring Rooted Subtrees on a Bounded Degree Host Tree
(2007)
We consider a rooted tree R to be a rooted subtree of a given tree T if the tree obtained by replacing the directed arcs of R by undirected edges is a subtree of T.
In this work, we study the problem of assigning colors ...
Probabilistic Language Framework for Stochastic Discrete Event Systems
(1996)
We introduce the notion of probabilistic languages to describe the qualitative behavior of stochastic discrete event systems. Regular language operators such as choice, concatenation, and Kleene-closure have been defined ...
Estimation of Hidden Markov Models
(2001)
A risk-sensitive generalization of the Maximum A Posterior Probability (MAP) estimationfor partially observed Markov chains is presented.Using a change of measure technique,a cascade filtering scheme for the risk-sensitivestate ...
Risk-Sensitive Probability for Markov Chains
(2002)
The probability distribution of a Markov chain is viewed as the information state of an additive optimization problem. This optimization problem is then generalized to a product form whose information state gives rise to ...