SEQUENTIAL DECISION MAKING WITH LIMITED RESOURCES

Sankararaman, Karthik Abinav

SEQUENTIAL DECISION MAKING WITH LIMITED RESOURCES

dc.contributor.advisor	Srinivasan, Aravind	en_US
dc.contributor.author	Sankararaman, Karthik Abinav	en_US
dc.contributor.department	Computer Science	en_US
dc.contributor.publisher	Digital Repository at the University of Maryland	en_US
dc.contributor.publisher	University of Maryland (College Park, Md.)	en_US
dc.date.accessioned	2019-09-27T05:40:24Z
dc.date.available	2019-09-27T05:40:24Z
dc.date.issued	2019	en_US
dc.description.abstract	One of the goals of Artificial Intelligence (AI) is to enable multiple agents to interact, co-ordinate and compete with each other to realize various goals. Typically, this is achieved via a system which acts as a mediator to control the agents' behavior via incentives. Such systems are ubiquitous and include online systems for shopping (e.g., Amazon), ride-sharing (e.g., Uber, Lyft) and Internet labor markets (e.g., Mechanical Turk). The main algorithmic challenge in such systems is to ensure that they can operate under a variety of informational constraints such as uncertainty in the input, committing to actions based on partial information or being unaffected by noisy input. The mathematical framework used to study such systems are broadly called \emph{sequential decision making} problems where the algorithm does not receive the entire input at once; it obtains parts of the input by interacting (also called "actions") with the environment. In this thesis, we answer the question, under what informational constraints can we design efficient algorithms for sequential decision making problems. The first part of the thesis deals with the Online Matching problem. Here, the algorithm deals with two prominent constraints: uncertainty in the input and choice of actions being restricted by a combinatorial constraint. We design several new algorithms for many variants of this problem and provide provable guarantees. We also show their efficacy on the ride-share application using a real-world dataset. In the second part of the thesis, we consider the Multi-armed bandit problem with additional informational constraints. In this setting, the algorithm does not receive the entire input and needs to make decisions based on partial observations. Additionally, the set of possible actions is controlled by global resource constraints that bind across time. We design new algorithms for multiple variants of this problem that are worst-case optimal. We provide a general reduction framework to the classic multi-armed bandits problem without any constraints. We complement some of the results with preliminary numerical experiments.	en_US
dc.identifier	https://doi.org/10.13016/tnst-gkmo
dc.identifier.uri	http://hdl.handle.net/1903/25039
dc.language.iso	en	en_US
dc.subject.pqcontrolled	Computer science	en_US
dc.subject.pqcontrolled	Operations research	en_US
dc.subject.pqcontrolled	Applied mathematics	en_US
dc.subject.pquncontrolled	Artificial Intelligence	en_US
dc.subject.pquncontrolled	Machine Learning	en_US
dc.subject.pquncontrolled	Multi-armed Bandits	en_US
dc.subject.pquncontrolled	Probability	en_US
dc.subject.pquncontrolled	Randomized Online Algorithms	en_US
dc.subject.pquncontrolled	Sequential Decision Making	en_US
dc.title	SEQUENTIAL DECISION MAKING WITH LIMITED RESOURCES	en_US
dc.type	Dissertation	en_US

Files

Original bundle

Now showing 1 - 2 of 2

Name:: Sankararaman_umd_0117E_20258.pdf
Size:: 2.5 MB
Format:: Adobe Portable Document Format

Download

Name:: Defense_editted.mp4
Size:: 86.66 MB
Format:: Unknown data format

Download

Collections

UMD Theses and Dissertations
Computer Science Theses and Dissertations