Dopamine modulated STDP and reinforcement learning in an embodied context

Andersen, Lars; Haus, Tormund Sandve

Andersen, Lars; Haus, Tormund Sandve

Master thesis

Åpne

655640_FULLTEXT01.pdf (2.342Mb)

655640_COVER01.pdf (184.4Kb)

Permanent lenke

http://hdl.handle.net/11250/253403

Utgivelsesdato

2013

Metadata

Vis full innførsel

Samlinger

Institutt for datateknologi og informatikk [6828]

Sammendrag

In recent years artificial neural networks have become increasing popular. New methods and ever increasing computational resources are turning second generation artificial neural networks into powerful tools. Most of the work done with second generation artificial neuron networks do, however, at one point or another involve a phase of supervised learning. Supervised learning methods are inherently limited by the need for labeled training examples. One way of solving this scaling problem is to rely on reinforcement learning, which is a form of unsupervised learning. The more biologically plausible third generation of artificial neural networks have recently been shown capable of tackling the distal reward problem that is at the core of reinforcement learning. Using dopamine modulated spike-timing-dependent plasticity in a spiking neural network, we successfully demonstrate classical conditioning, instrumental conditioning, extinction and second order conditioning in an embodied context.

Utgiver

Institutt for datateknikk og informasjonsvitenskap