SEQUENTIAL DECISION MAKING WITH LIMITED RESOURCES

dc.contributor.advisorSrinivasan, Aravinden_US
dc.contributor.authorSankararaman, Karthik Abinaven_US
dc.contributor.departmentComputer Scienceen_US
dc.contributor.publisherDigital Repository at the University of Marylanden_US
dc.contributor.publisherUniversity of Maryland (College Park, Md.)en_US
dc.date.accessioned2019-09-27T05:40:24Z
dc.date.available2019-09-27T05:40:24Z
dc.date.issued2019en_US
dc.description.abstractOne of the goals of Artificial Intelligence (AI) is to enable multiple agents to interact, co-ordinate and compete with each other to realize various goals. Typically, this is achieved via a system which acts as a mediator to control the agents' behavior via incentives. Such systems are ubiquitous and include online systems for shopping (e.g., Amazon), ride-sharing (e.g., Uber, Lyft) and Internet labor markets (e.g., Mechanical Turk). The main algorithmic challenge in such systems is to ensure that they can operate under a variety of informational constraints such as uncertainty in the input, committing to actions based on partial information or being unaffected by noisy input. The mathematical framework used to study such systems are broadly called \emph{sequential decision making} problems where the algorithm does not receive the entire input at once; it obtains parts of the input by interacting (also called "actions") with the environment. In this thesis, we answer the question, under what informational constraints can we design efficient algorithms for sequential decision making problems. The first part of the thesis deals with the Online Matching problem. Here, the algorithm deals with two prominent constraints: uncertainty in the input and choice of actions being restricted by a combinatorial constraint. We design several new algorithms for many variants of this problem and provide provable guarantees. We also show their efficacy on the ride-share application using a real-world dataset. In the second part of the thesis, we consider the Multi-armed bandit problem with additional informational constraints. In this setting, the algorithm does not receive the entire input and needs to make decisions based on partial observations. Additionally, the set of possible actions is controlled by global resource constraints that bind across time. We design new algorithms for multiple variants of this problem that are worst-case optimal. We provide a general reduction framework to the classic multi-armed bandits problem without any constraints. We complement some of the results with preliminary numerical experiments.en_US
dc.identifierhttps://doi.org/10.13016/tnst-gkmo
dc.identifier.urihttp://hdl.handle.net/1903/25039
dc.language.isoenen_US
dc.subject.pqcontrolledComputer scienceen_US
dc.subject.pqcontrolledOperations researchen_US
dc.subject.pqcontrolledApplied mathematicsen_US
dc.subject.pquncontrolledArtificial Intelligenceen_US
dc.subject.pquncontrolledMachine Learningen_US
dc.subject.pquncontrolledMulti-armed Banditsen_US
dc.subject.pquncontrolledProbabilityen_US
dc.subject.pquncontrolledRandomized Online Algorithmsen_US
dc.subject.pquncontrolledSequential Decision Makingen_US
dc.titleSEQUENTIAL DECISION MAKING WITH LIMITED RESOURCESen_US
dc.typeDissertationen_US

Files

Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
Sankararaman_umd_0117E_20258.pdf
Size:
2.5 MB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
Defense_editted.mp4
Size:
86.66 MB
Format:
Unknown data format