dc.contributor.advisor | Downing, Keith | nb_NO |
dc.contributor.author | Sjonfjell, Vegard Aksland | nb_NO |
dc.date.accessioned | 2014-12-19T13:41:29Z | |
dc.date.available | 2014-12-19T13:41:29Z | |
dc.date.created | 2014-09-30 | nb_NO |
dc.date.issued | 2014 | nb_NO |
dc.identifier | 751082 | nb_NO |
dc.identifier | ntnudaim:8797 | nb_NO |
dc.identifier.uri | http://hdl.handle.net/11250/253755 | |
dc.description.abstract | This project explores the ability of Recurrent Neural Networks (RNNs) to memorize previous input states in time series problems.A type of RNN called Long Short Term Memory (LSTM), which is designed specifically to be able to handle long term dependencies in input data, is being compared against recurrent multi layer perceptrons (recurrent MLPs) on two non-trivial time series problems which require a varying number of previous events to be remembered to predict the next state. The first problem being that of artificial grammar learning: learning a randomly generated grammar by only being subjected to a series of sample strings produced by a nondeterministic symbol producing automaton. The second problem explored in this project is to train an agent, by reinforcement learning, to play a modified version of the computer game Flappy Bird.The results show that LSTM is able to outclass standard recurrent MLPs in the artificial grammar learning task since it can remember past states of the time series stream several timesteps after they have occurred, without any degradation.The LSTM based agent also manages to score substantially higher in the Flappy Bird game than both a feedforward and recurrent multi layer perceptron (MLP) based agent. The reason for this is possibly because it is more resistant to variations in the input. | nb_NO |
dc.language | eng | nb_NO |
dc.publisher | Institutt for datateknikk og informasjonsvitenskap | nb_NO |
dc.title | Remembering Past States using Long Short Term Memory Neural Networks | nb_NO |
dc.type | Master thesis | nb_NO |
dc.source.pagenumber | 73 | nb_NO |
dc.contributor.department | Norges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskap | nb_NO |