Vis enkel innførsel

dc.contributor.authorJohnsen, Pål Vegard
dc.contributor.authorStrumke, Inga
dc.contributor.authorLangaas, Mette
dc.contributor.authorDeWan, Andrew Thomas
dc.contributor.authorRiemer-Sørensen, Signe
dc.date.accessioned2023-11-01T06:57:18Z
dc.date.available2023-11-01T06:57:18Z
dc.date.created2023-04-11T13:36:57Z
dc.date.issued2023
dc.identifier.issn1553-734X
dc.identifier.urihttps://hdl.handle.net/11250/3099866
dc.description.abstractEstimating feature importance, which is the contribution of a prediction or several predictions due to a feature, is an essential aspect of explaining data-based models. Besides explaining the model itself, an equally relevant question is which features are important in the underlying data generating process. We present a Shapley-value-based framework for inferring the importance of individual features, including uncertainty in the estimator. We build upon the recently published model-agnostic feature importance score of SAGE (Shapley additive global importance) and introduce Sub-SAGE. For tree-based models, it has the advantage that it can be estimated without computationally expensive resampling. We argue that for all model types the uncertainties in our Sub-SAGE estimator can be estimated using bootstrapping and demonstrate the approach for tree ensemble methods. The framework is exemplified on synthetic data as well as large genotype data for predicting feature importance with respect to obesity.en_US
dc.language.isoengen_US
dc.publisherPublic Library of Science, PLOSen_US
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleInferring feature importance with uncertainties with application to large genotype dataen_US
dc.title.alternativeInferring feature importance with uncertainties with application to large genotype dataen_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionpublishedVersionen_US
dc.source.volume19en_US
dc.source.journalPLoS Computational Biologyen_US
dc.source.issue3en_US
dc.identifier.doi10.1371/journal.pcbi.1010963
dc.identifier.cristin2139997
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode2


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal