Finding the Value of Information About a State Variable in a Markov Decision Process

Loading...
Thumbnail Image

Files

MDP_paper_v5.pdf (668.43 KB)
No. of downloads: 990

Publication or External Link

Date

2005-01-13T15:04:40Z

Advisor

Citation

DRUM DOI

Abstract

In this paper we present a mixed–integer programming formulation that computes the optimal solution for a certain class of Markov decision processes with finite state and action spaces, where a state is comprised of multiple state variables, and one of the state variables is unobservable to the decision maker. Our approach is a much simpler modeling alternative to the theory of partially observable Markov decision processes (POMDP), where an information and updating structure about the decision variable needs to be defined. We illustrate the approach with an example of a duopoly where one firm’s actions are not immediately observable by the other firm, and present computational results. We believe that this approach can be used in a variety of applications, where the decision maker wants to assess the value of information about an additional decision variable.

Notes

Rights