Vis enkel innførsel

dc.contributor.authorFjerdingen, Sigurd Aksnes
dc.contributor.authorKyrkjebø, Erik
dc.contributor.authorTranseth, Aksel Andreas
dc.date.accessioned2018-03-09T07:57:40Z
dc.date.available2018-03-09T07:57:40Z
dc.date.created2012-03-30T11:40:48Z
dc.date.issued2010
dc.identifier.isbn9783800732739
dc.identifier.urihttp://hdl.handle.net/11250/2489643
dc.description.abstractThis paper analyzes the application of several reinforcement learning techniques for continuous state and action spaces to pipeline following for an autonomous underwater vehicle (AUV). Continuous space SARSA is compared to the actor-critic CACLA algorithm, and is also extended into a supervised reinforcement learning architecture. A novel exploration method using the skew-normal stochastic distribution is proposed, and evidence towards advantages in the case of tabula rasa exploration is presented. Results are validated on a realistic simulator of the AUV, and confirm the applicability of reinforcement learning to optimize pipeline following behavior.nb_NO
dc.language.isoengnb_NO
dc.publisherVDE Verlag GmbHnb_NO
dc.relation.ispartofProceedings for the joint conference of ISR 2010, 41st International Symposium on Robotics, ROBOTIK 2010, 6th German Conference on Robotics
dc.titleAUV Pipeline Following using Reinforcement Learningnb_NO
dc.typeChapternb_NO
dc.description.versionacceptedVersionnb_NO
dc.source.pagenumber310-317nb_NO
dc.identifier.cristin918391
dc.description.localcode© 2011 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.nb_NO
cristin.unitcode194,63,25,0
cristin.unitnameInstitutt for teknisk kybernetikk
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel