Search
Now showing items 1-10 of 16
Approximate Receding Horizon Approach for Markov Decision Processes: Average Reward Case
(2001)
Building on the receding horizon approach by Hernandez-Lerma andLasserre in solving Markov decision processes (MDPs),this paper first analyzes the performance of the (approximate) receding horizon approach in terms of ...
Simulation-Based Approach for Semiconductor Fab-Level Decision Making - Implementation Issues
(2000)
In this paper, we discuss implementation issues of applying a simulation-based approach to asemiconductor fab-level decision making problem. The fab-level decision making problem isformulated as a Markov Decision Process ...
Stability of Wireless Networks for Mode S Radar
(2000)
Stability issues in a connectionless, one-hop queueing system featuringservers with overlapping service regions (e.g. a Mode Select (Mode S) Radarcommunications network or part of an Aeronautical Telecommunications Network ...
Randomized Difference Two-Timescale Simultaneous Perturbation Stochastic Approximation Algorithms for Simulation Optimization of Hidden Markov Models
(2000)
We proposetwo finite difference two-timescale simultaneous perturbationstochastic approximation (SPSA)algorithmsfor simulation optimization ofhidden Markov models. Stability and convergence of both thealgorithms is ...
An Asymptotically Efficient Algorithm for Finite Horizon Stochastic Dynamic Programming Problems
(2003)
We present a novel algorithm, called ``Simulated Annealing Multiplicative Weights", for approximately solving large finite-horizon stochastic dynamic programming problems. The algorithm is ``asymptotically efficient" in ...
Solving Continuous-State POMDPs via Density Projection
(2007)
Research on numerical solution methods for partially observable Markov decision processes (POMDPs) has primarily focused on discrete-state models, and these algorithms do not generally extend to continuous-state POMDPs, ...
Multi-time Scale Markov Decision Processes
(2002)
This paper proposes a simple analytical model called M time-scale MarkovDecision Process (MMDP) for hierarchically structured sequential decision making processes, where decisions in each level in the M-level hierarchy are ...
Markov Games: Receding Horizon Approach
(2001)
We consider a receding horizon approach as an approximate solution totwo-person zero-sum Markov games with infinite horizon discounted costand average cost criteria. <p>We first present error bounds from the optimalequilibrium ...
Coloring Rooted Subtrees on a Bounded Degree Host Tree
(2007)
We consider a rooted tree R to be a rooted subtree of a given tree T if the tree obtained by replacing the directed arcs of R by undirected edges is a subtree of T.
In this work, we study the problem of assigning colors ...