Vis enkel innførsel

dc.contributor.authorZhang, Shiqing
dc.contributor.authorNaderan-Tahan, Mahmood
dc.contributor.authorJahre, Magnus
dc.contributor.authorEeckhout, Lieven
dc.date.accessioned2023-08-16T09:11:43Z
dc.date.available2023-08-16T09:11:43Z
dc.date.created2023-06-30T09:34:15Z
dc.date.issued2023
dc.identifier.isbn979-8-4007-0095-8
dc.identifier.urihttps://hdl.handle.net/11250/3084351
dc.description.abstractBandwidth non-uniformity in multi-chip GPUs poses a major design challenge for its last-level cache (LLC) architecture. Whereas a memory-side LLC caches data from the local memory partition while being accessible by all chips, an SM-side LLC is private to a chip while caching data from all memory partitions. We find that some workloads prefer a memory-side LLC while others prefer an SM-side LLC, and this preference solely depends on which organization maximizes the effective LLC bandwidth. In contrast to prior work which optimizes bandwidth beyond the LLC, we make the observation that the effective bandwidth ahead of the LLC is critical to end-to-end application performance. We propose Sharing-Aware Caching (SAC) to adopt either a memory-side or SM-side LLC organization by dynamically reconfiguring the routing policies in the intra-chip interconnection network and LLC controllers. SAC is driven by a simple and lightweight analytical model that predicts the impact of data sharing across chips on the effective LLC bandwidth. SAC improves average performance by 76% and 12% (and up to 157% and 49%) compared to a memory-side and SM-side LLC, respectively. We demonstrate significant performance improvements across the design space and across workloads.en_US
dc.language.isoengen_US
dc.publisherACMen_US
dc.relation.ispartofProceedings of the 50th Annual International Symposium on Computer Architecture (ISCA'23)
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleSAC: Sharing-Aware Caching in Multi-Chip GPUsen_US
dc.title.alternativeSAC: Sharing-Aware Caching in Multi-Chip GPUsen_US
dc.typeChapteren_US
dc.description.versionacceptedVersionen_US
dc.source.pagenumber605-617en_US
dc.identifier.doi10.1145/3579371.3589078
dc.identifier.cristin2159691
dc.relation.projectNorges forskningsråd: 286596en_US
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal