Vis enkel innførsel

dc.contributor.authorMeyer, Eivind
dc.contributor.authorRobinson, Haakon
dc.contributor.authorRasheed, Adil
dc.contributor.authorSan, Omer
dc.date.accessioned2021-04-20T13:26:50Z
dc.date.available2021-04-20T13:26:50Z
dc.date.created2020-02-16T23:51:47Z
dc.date.issued2020
dc.identifier.citationIEEE Access. 2020, 8 41466-41481.en_US
dc.identifier.issn2169-3536
dc.identifier.urihttps://hdl.handle.net/11250/2738708
dc.description.abstractIn this article, we explore the feasibility of applying proximal policy optimization, a state-of-the-art deep reinforcement learning algorithm for continuous control tasks, on the dual-objective problem of controlling an underactuated autonomous surface vehicle to follow an a priori known path while avoiding collisions with non-moving obstacles along the way. The AI agent, which is equipped with multiple rangefinder sensors for obstacle detection, is trained and evaluated in a challenging, stochastically generated simulation environment based on the OpenAI gym Python toolkit. Notably, the agent is provided with real-time insight into its own reward function, allowing it to dynamically adapt its guidance strategy. Depending on its strategy, which ranges from radical path-adherence to radical obstacle avoidance, the trained agent achieves an episodic success rate close to 100%.en_US
dc.language.isoengen_US
dc.publisherIEEEen_US
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleTaming an autonomous surface vehicle for path following and collision avoidance using deep reinforcement learningen_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionpublishedVersionen_US
dc.source.pagenumber41466-41481en_US
dc.source.volume8en_US
dc.source.journalIEEE Accessen_US
dc.identifier.doi10.1109/ACCESS.2020.2976586
dc.identifier.cristin1794567
dc.relation.projectNorges forskningsråd: 295033en_US
dc.description.localcodeOpen access. Published by IEEE.en_US
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.fulltextoriginal
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal