Representation Learning for Reinforcement Learning: Modeling Non-Gaussian Transition Probabilities with a Wasserstein Critic

Tse, Ryan

Representation Learning for Reinforcement Learning: Modeling Non-Gaussian Transition Probabilities with a Wasserstein Critic

Files

Tse_umd_0117N_24835.pdf (17.62 MB)

No. of downloads: 6

Date

2024

Authors

Tse, Ryan

Advisor

Zhang, Kaiqing

DRUM DOI

https://doi.org/10.13016/fhth-mcku

Abstract

Reinforcement learning algorithms depend on effective state representations when solving complex, high-dimensional environments. Recent methods learn state representations using auxiliary objectives that aim to capture relationships between states that are behaviorally similar, meaning states that lead to similar future outcomes under optimal policies. These methods learn explicit probabilistic state transition models and compute distributional distances between state transition probabilities as part of their measure of behavioral similarity.

This thesis presents a novel extension to several of these methods that directly learns the 1-Wasserstein distance between state transition distributions by exploiting the Kantorovich-Rubenstein duality. This method eliminates parametric assumptions about the state transition probabilities while providing a smoother estimator of distributional distances. Empirical evaluation demonstrates improved sample efficiency over some of the original methods and a modest increase in computational cost per sample. The results establish that relaxing theoretical assumptions about state transition modeling leads to more flexible and robust representation learning while maintaining strong performance characteristics.x

URI (handle)

http://hdl.handle.net/1903/33716

Collections

UMD Theses and Dissertations
Electrical & Computer Engineering Theses and Dissertations

Full item page