Vis enkel innførsel

dc.contributor.authorAllen, Julia Filiberti
dc.contributor.authorSchmidt, Steve
dc.contributor.authorGabriel, Steven Adam
dc.date.accessioned2024-02-27T08:56:01Z
dc.date.available2024-02-27T08:56:01Z
dc.date.created2023-04-28T11:31:35Z
dc.date.issued2023
dc.identifier.citationSN Computer Science. 2023, 4 (4), .en_US
dc.identifier.issn2662-995X
dc.identifier.urihttps://hdl.handle.net/11250/3120038
dc.description.abstractDeep neural networks are naturally “black boxes”, offering little insight into how or why they make decisions. These limitations diminish the adoption likelihood of such systems for important tasks and as trusted teammates. We design and employ an introspective method to abstract neural activation patterns into human-interpretable strategies and identify relationships between environmental conditions (why), strategies (how), and performance (result) on a deep reinforcement learning two-dimensional pursuit game application. For example, we found that activation patterns that were abstracted into “head-on” or “L-shaped” maneuver strategies were successful and intuitively corresponded to favorable initial conditions. Moreover, we characterize machine commitment by the introduction of a novel measure based on analysis of time-series neural activation patterns over the course of a game, and reveal significant correlations between machine commitment and performance. By uncovering temporally-dependent machine “thought processes” and commitment through introspection, we contribute to the larger explainable artificial intelligence initiative, increasing transparency and trust in machine learning systems.en_US
dc.language.isoengen_US
dc.publisherSpringeren_US
dc.titleUncovering Strategies and Commitment Through Machine Learning System Introspectionen_US
dc.title.alternativeUncovering Strategies and Commitment Through Machine Learning System Introspectionen_US
dc.typeJournal articleen_US
dc.typePeer revieweden_US
dc.description.versionpublishedVersionen_US
dc.source.volume4en_US
dc.source.journalSN Computer Scienceen_US
dc.source.issue4en_US
dc.identifier.doi10.1007/s42979-023-01747-8
dc.identifier.cristin2144158
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel