Vis enkel innførsel

dc.contributor.authorBekkemoen, Yanzhe
dc.date.accessioned2024-06-10T11:27:53Z
dc.date.available2024-06-10T11:27:53Z
dc.date.created2023-11-30T08:30:59Z
dc.date.issued2023
dc.identifier.citationMachine Learning. 2023, 113 355-441.en_US
dc.identifier.issn0885-6125
dc.identifier.urihttps://hdl.handle.net/11250/3133318
dc.description.abstractIn recent years, reinforcement learning (RL) systems have shown impressive performance and remarkable achievements. Many achievements can be attributed to combining RL with deep learning. However, those systems lack explainability, which refers to our understanding of the system’s decision-making process. In response to this challenge, the new explainable RL (XRL) field has emerged and grown rapidly to help us understand RL systems. This systematic literature review aims to give a unified view of the field by reviewing ten existing XRL literature reviews and 189 XRL studies from the past five years. Furthermore, we seek to organize these studies into a new taxonomy, discuss each area in detail, and draw connections between methods and stakeholder questions (e.g., “how can I get the agent to do _?”). Finally, we look at the research trends in XRL, recommend XRL methods, and present some exciting research directions for future research. We hope stakeholders, such as RL researchers and practitioners, will utilize this literature review as a comprehensive resource to overview existing state-of-the-art XRL methods. Additionally, we strive to help find research gaps and quickly identify methods that answer stakeholder questions.en_US
dc.language.isoengen_US
dc.publisherSpringer Natureen_US
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleExplainable reinforcement learning (XRL): a systematic literature review and taxonomyen_US
dc.title.alternativeExplainable reinforcement learning (XRL): a systematic literature review and taxonomyen_US
dc.typeJournal articleen_US
dc.typePeer revieweden_US
dc.description.versionpublishedVersionen_US
dc.source.pagenumber355-441en_US
dc.source.volume113en_US
dc.source.journalMachine Learningen_US
dc.identifier.doi10.1007/s10994-023-06479-7
dc.identifier.cristin2205895
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode2


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal