Learning to Control Multi- Dimensional Autonomous Agents using Hebbian Learning

A Global Reward Approach

Master Thesis (2018)
Author(s)

A. Husić (TU Delft - Mechanical Engineering)

Contributor(s)

Martijn Wisse – Mentor

Wouter Wolfslag – Mentor

Faculty
Mechanical Engineering
Copyright
© 2018 Ajdin Husić
More Info
expand_more
Publication Year
2018
Language
English
Copyright
© 2018 Ajdin Husić
Graduation Date
20-12-2018
Awarding Institution
Delft University of Technology
Programme
Mechanical Engineering | Systems and Control
Faculty
Mechanical Engineering
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The novelty-raahn algorithm has been shown to effectively learn a desired behavior from raw inputs by connecting an autoencoder with a Hebbian network. Hebbian learning is compelling for its biological plausibility and simplicity. It changes the weight of a connection based only on the activations of neurons it connects, and can effectively reinforce good behaviors when combined with neuromodulation. These low-level synaptic weight changes make for a better merge of the three learning tasks of perception, prediction and action. However, the state-ofthe art algorithm requires the design of a highly detailed modulation scheme designed for a specific system, which is disconnected from the overall objective it optimizes. In this thesis, we will propose that similar learning behavior can be achieved, by making the autonomous agent react to longer-term rewards, and thus implicitly introducing prediction capabilities. In doing so, the required modulation scheme becomes connected to the global optimization objective.

Files

Ajdin_Thesis.pdf
(pdf | 1.16 Mb)
License info not available