Model-Free All-Source-All-Destination Learning as a Model for Biological Reactive Control

Knudsen, Martinius; Hendseth, Sverre; Tufte, Gunnar; Sandvig, Axel

dc.contributor.author	Knudsen, Martinius
dc.contributor.author	Hendseth, Sverre
dc.contributor.author	Tufte, Gunnar
dc.contributor.author	Sandvig, Axel
dc.date.accessioned	2022-03-30T11:58:41Z
dc.date.available	2022-03-30T11:58:41Z
dc.date.created	2022-01-14T17:08:31Z
dc.date.issued	2021
dc.identifier.citation	Modeling, Identification and Control. 2021, 42 (4), 197-204.	en_US
dc.identifier.issn	0332-7353
dc.identifier.uri	https://hdl.handle.net/11250/2988592
dc.description.abstract	We present here a model-free method for learning actions that lead to an all-source-all-destination shortest path solution. We motivate our approach in the context of biological learning for reactive control. Our method involves an agent exploring an unknown world with the objective of learning how to get from any starting state to any goal state in shortest time without having to run a path planning algorithm for each new goal selection. Using concepts of Lyapunov functions and Bellman's principle of optimality, our agent learns universal state-goal distances and best actions that solve this problem.	en_US
dc.language.iso	eng	en_US
dc.publisher	Norwegian Society of Automatic Control	en_US
dc.rights	Navngivelse 4.0 Internasjonal	*
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/deed.no	*
dc.title	Model-Free All-Source-All-Destination Learning as a Model for Biological Reactive Control	en_US
dc.type	Journal article	en_US
dc.type	Peer reviewed	en_US
dc.description.version	publishedVersion	en_US
dc.source.pagenumber	197-204	en_US
dc.source.volume	42	en_US
dc.source.journal	Modeling, Identification and Control	en_US
dc.source.issue	4	en_US
dc.identifier.doi	10.4173/mic.2021.4.5
dc.identifier.cristin	1981523
cristin.ispublished	true
cristin.fulltext	original
cristin.qualitycode	1

Tilhørende fil(er)

Filnavn:: MIC-2021-4-5.pdf
Størrelse:: 372.0Kb
Format:: PDF
Beskrivelse:: MIC-2021

Åpne

Denne innførselen finnes i følgende samling(er)

Institutt for datateknologi og informatikk [6772]
Institutt for nevromedisin og bevegelsesvitenskap [3215]
Institutt for teknisk kybernetikk [3740]
Publikasjoner fra CRIStin - NTNU [38070]

Vis enkel innførsel

Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal