Transition Space Distance Learning

Nemecek, Mark William

Transition Space Distance Learning

View / Download2.76 MB

Date

2019

Authors

Nemecek, Mark William

Advisors

Parr, Ronald

Repository Usage Stats

188
views

185
downloads

Abstract

The notion of distance plays and important role in many reinforcement learning (RL) techniques. This role may be explicit, as in some non-parametric approaches, or it may be implicit in the architecture of the feature space. The ability to learn distance functions tailored for RL tasks could, thus, benefit many different RL paradigms. While several approaches to learning distance functions from data do exist, they are frequently intended for use in clustering or classification tasks and typically do not take into account the inherent structure present in trajectories sampled from RL environments. For those that do, this structure is generally used to define a similarity between states rather than to represent the mechanics of the domain. Based on the idea that a good distance function in such a domain would reflect the number of transitions necessary to get to from one state to another, we detail an approach to learning distance functions which accounts for the nature of state transitions in a Markov decision process, including their inherent directionality. We then present the results of experiments performed in multiple RL environments in order to demonstrate the benefit of learning such distance functions.

Type

Master's thesis

Department

Computer Science

Subjects

Artificial intelligence, Computer science, Distance learning, Feature extraction, Reinforcement learning

Permalink

https://hdl.handle.net/10161/18849

Citation

Nemecek, Mark William (2019). Transition Space Distance Learning. Master's thesis, Duke University. Retrieved from https://hdl.handle.net/10161/18849.

Collections

Masters Theses

Full item page

Except where otherwise noted, student scholarship that was shared on DukeSpace after 2009 is made available to the public under a Creative Commons Attribution / Non-commercial / No derivatives (CC-BY-NC-ND) license. All rights in student work shared on DukeSpace before 2009 remain with the author and/or their designee, whose permission may be required for reuse.

Transition Space Distance Learning

Date

Authors

Advisors

Journal Title

Journal ISSN

Volume Title

Repository Usage Stats

Abstract

Type

Department

Description

Provenance

Subjects

Citation

Permalink

Citation

Collections