Reinforcement Learning Methods for Conic Finance

Chopra, Sahil

Reinforcement Learning Methods for Conic Finance

Files

Chopra_umd_0117E_20941.pdf (1.68 MB)

No. of downloads: 121

Date

2020

Authors

Chopra, Sahil

Advisor

Madan, Dilip

DRUM DOI

https://doi.org/10.13016/6zp9-sefo

Abstract

Conic Finance is a world of two-prices, a more grounded reality than the theory of one-price. The world, however, is constructed by considering nonadditive expectations of risks or value functions. This makes some of the optimization algorithms incompatible with this universe, if not infeasible. It is more evident in the application of Reinforcement Learning algorithms where the underlying principle of TD learning and Bellman equations are based on the additivity of value functions. Hence, the task undertaken here is to mold the recent advances in the field of Distributional Reinforcement Learning to be conducive to learning in the setting of nonadditive dynamics. Algorithms for discrete and continuous actions are described and illustrated on sample problems in finance.

URI (handle)

http://hdl.handle.net/1903/26471

Collections

UMD Theses and Dissertations
Computer Science Theses and Dissertations
Mathematics Theses and Dissertations

Full item page