Dopamine modulated STDP and reinforcement learning in an embodied context
Abstract
In recent years artificial neural networks have become increasing popular. New methods and ever increasing computational resources are turning second generation artificial neural networks into powerful tools. Most of the work done with second generation artificial neuron networks do, however, at one point or another involve a phase of supervised learning. Supervised learning methods are inherently limited by the need for labeled training examples. One way of solving this scaling problem is to rely on reinforcement learning, which is a form of unsupervised learning. The more biologically plausible third generation of artificial neural networks have recently been shown capable of tackling the distal reward problem that is at the core of reinforcement learning. Using dopamine modulated spike-timing-dependent plasticity in a spiking neural network, we successfully demonstrate classical conditioning, instrumental conditioning, extinction and second order conditioning in an embodied context.