Show simple item record

dc.contributor.authorAbolpour Mofrad, Asieh
dc.contributor.authorYazidi, Anis
dc.contributor.authorAbolpour Mofrad, Samaneh
dc.contributor.authorHammer, Hugo Lewi
dc.contributor.authorArntzen, Erik
dc.date.accessioned2021-02-25T07:50:58Z
dc.date.available2021-02-25T07:50:58Z
dc.date.created2021-02-03T14:59:44Z
dc.date.issued2021
dc.identifier.issn0899-7667
dc.identifier.urihttps://hdl.handle.net/11250/2730237
dc.description.abstractFormation of stimulus equivalence classes has been recently modeled through equivalence projective simulation (EPS), a modified version of a projective simulation (PS) learning agent. PS is endowed with an episodic memory that resembles the internal representation in the brain and the concept of cognitive maps. PS flexibility and interpretability enable the EPS model and, consequently the model we explore in this letter, to simulate a broad range of behaviors in matching-to-sample experiments. The episodic memory, the basis for agent decision making, is formed during the training phase. Derived relations in the EPS model that are not trained directly but can be established via the network's connections are computed on demand during the test phase trials by likelihood reasoning. In this letter, we investigate the formation of derived relations in the EPS model using network enhancement (NE), an iterative diffusion process, that yields an offline approach to the agent decision making at the testing phase. The NE process is applied after the training phase to denoise the memory network so that derived relations are formed in the memory network and retrieved during the testing phase. During the NE phase, indirect relations are enhanced, and the structure of episodic memory changes. This approach can also be interpreted as the agent's replay after the training phase, which is in line with recent findings in behavioral and neuroscience studies. In comparison with EPS, our model is able to model the formation of derived relations and other features such as the nodal effect in a more intrinsic manner. Decision making in the test phase is not an ad hoc computational method, but rather a retrieval and update process of the cached relations from the memory network based on the test trial. In order to study the role of parameters on agent performance, the proposed model is simulated and the results discussed through various experimental settings.en_US
dc.language.isoengen_US
dc.publisherMIT Pressen_US
dc.titleEnhanced Equivalence Projective Simulation: A Framework for Modeling Formation of Stimulus Equivalence Classesen_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionpublishedVersionen_US
dc.source.journalNeural Computationen_US
dc.identifier.doi10.1162/neco_a_01346
dc.identifier.cristin1886392
dc.description.localcodeLocked until 2.5.2021 due to copyright restrictions. © 2020 Massachusetts Institute of Technologyen_US
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode2


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record