Heuristics-based compartmentalization of Replay memory in simple environments

Wasaznik, Aleksander Gustaw

dc.contributor.advisor	Ruocco, Massimiliano
dc.contributor.author	Wasaznik, Aleksander Gustaw
dc.date.accessioned	2020-02-29T15:00:09Z
dc.date.available	2020-02-29T15:00:09Z
dc.date.issued	2019
dc.identifier.uri	http://hdl.handle.net/11250/2644496
dc.description.abstract	En viktig komponent av moderne forsterkningslæringsalgoritmer er repriseminnet. En rekke foreslåtte endringer i virkemåten til repriseminnet har blitt utforsket, men de fleste har med samplingsmekanismen å gjøre. Denne rapporten utforsker muligheten for å utvide en annen side ved repriseminnealgoritmen: det å avgjøre intelligent hvilken erfaring som skal erstattes når en ny erfaring legges til i et fullt minne. Metoden som utforskes er å dele repriseminnet i to og bruke en heuristikk til å fordele erfaringen mellom de to delene.
dc.description.abstract	A ubiquitous component of state of the art reinforcement learning algorithms is the replay memory. Numerous proposed alterations to the operation of the replay memory have been explored, but they deal with the sampling mechanism. This report explores the possibility of augmenting another faucet of the replay memory algorithm: intelligently deciding on which experience to evict when adding new experience to a full memory. The explored method is to compartmentalize the replay memory into two buffers and direct experience to either based on a heuristic.
dc.language	eng
dc.publisher	NTNU
dc.title	Heuristics-based compartmentalization of Replay memory in simple environments
dc.type	Master thesis

Tilhørende fil(er)

Filer	Størrelse	Format	Vis

Denne innførselen finnes i følgende samling(er)

Institutt for datateknologi og informatikk [6544]

Vis enkel innførsel