Uncovering Strategies and Commitment Through Machine Learning System Introspection

Allen, Julia Filiberti; Schmidt, Steve; Gabriel, Steven Adam

dc.contributor.author	Allen, Julia Filiberti
dc.contributor.author	Schmidt, Steve
dc.contributor.author	Gabriel, Steven Adam
dc.date.accessioned	2024-02-27T08:56:01Z
dc.date.available	2024-02-27T08:56:01Z
dc.date.created	2023-04-28T11:31:35Z
dc.date.issued	2023
dc.identifier.citation	SN Computer Science. 2023, 4 (4), .	en_US
dc.identifier.issn	2662-995X
dc.identifier.uri	https://hdl.handle.net/11250/3120038
dc.description.abstract	Deep neural networks are naturally “black boxes”, offering little insight into how or why they make decisions. These limitations diminish the adoption likelihood of such systems for important tasks and as trusted teammates. We design and employ an introspective method to abstract neural activation patterns into human-interpretable strategies and identify relationships between environmental conditions (why), strategies (how), and performance (result) on a deep reinforcement learning two-dimensional pursuit game application. For example, we found that activation patterns that were abstracted into “head-on” or “L-shaped” maneuver strategies were successful and intuitively corresponded to favorable initial conditions. Moreover, we characterize machine commitment by the introduction of a novel measure based on analysis of time-series neural activation patterns over the course of a game, and reveal significant correlations between machine commitment and performance. By uncovering temporally-dependent machine “thought processes” and commitment through introspection, we contribute to the larger explainable artificial intelligence initiative, increasing transparency and trust in machine learning systems.	en_US
dc.language.iso	eng	en_US
dc.publisher	Springer	en_US
dc.title	Uncovering Strategies and Commitment Through Machine Learning System Introspection	en_US
dc.title.alternative	Uncovering Strategies and Commitment Through Machine Learning System Introspection	en_US
dc.type	Journal article	en_US
dc.type	Peer reviewed	en_US
dc.description.version	publishedVersion	en_US
dc.source.volume	4	en_US
dc.source.journal	SN Computer Science	en_US
dc.source.issue	4	en_US
dc.identifier.doi	10.1007/s42979-023-01747-8
dc.identifier.cristin	2144158
cristin.ispublished	true
cristin.fulltext	original
cristin.qualitycode	1

Tilhørende fil(er)

Filnavn:: Allen%2C+Schmidt%2C+Gabriel+-+ ...
Størrelse:: 1.576Mb
Format:: PDF

Låst

Denne innførselen finnes i følgende samling(er)

Institutt for industriell økonomi og teknologiledelse [3034]
Publikasjoner fra CRIStin - NTNU [37304]

Vis enkel innførsel