Abstract
In model-based Reinforcement Learning, an agent aims to learn a transition model between attainable states. Since the agent initially has zero knowledge of the transition model, it needs to resort to random exploration in order to learn the model. In this work, we demonstrate how the Ornstein-Uhlenbeck process can be used as a sampling scheme to generate exploratory Brownian motion in the absence of a transition model. Whereas current approaches rely on knowledge of the transition model to generate the steps of Brownian motion, the Ornstein-Uhlenbeck process does not. Additionally, the Ornstein-Uhlenbeck process naturally includes a drift term originating from a potential function. We show that this potential can be controlled by the agent itself, and allows executing non-equilibrium behavior such as ballistic motion or local trapping.
Original language | English |
---|---|
Title of host publication | COMPLEXIS 2019 - Proceedings of the 4th International Conference on Complexity, Future Information Systems and Risk |
Editors | Victor Mendez Munoz, Farshad Firouzi, Ernesto Estrada, Victor Chang |
Publisher | SciTePress |
Pages | 59-66 |
Number of pages | 8 |
ISBN (Electronic) | 9789897583667 |
DOIs | |
Publication status | Published - 2019 |
Externally published | Yes |
Event | 4th International Conference on Complexity, Future Information Systems and Risk, COMPLEXIS 2019 - Heraklion, Crete, Greece Duration: 2 May 2019 → 4 May 2019 |
Conference
Conference | 4th International Conference on Complexity, Future Information Systems and Risk, COMPLEXIS 2019 |
---|---|
Country/Territory | Greece |
City | Heraklion, Crete |
Period | 2/05/19 → 4/05/19 |
Keywords
- Brownian Motion
- Exploration
- Ornstein-Uhlenbeck Process