Vis enkel innførsel

dc.contributor.authorTran, Kim-anh
dc.contributor.authorCarlson, Trevor E.
dc.contributor.authorKoukos, Konstantinos
dc.contributor.authorSjälander, Magnus
dc.contributor.authorSpiliopoulos, Vasileios
dc.contributor.authorKaxiras, Stefanos
dc.contributor.authorJimborean, Alexandra
dc.date.accessioned2017-11-08T11:29:37Z
dc.date.available2017-11-08T11:29:37Z
dc.date.created2017-11-06T19:26:56Z
dc.date.issued2017
dc.identifier.issn0018-9340
dc.identifier.urihttp://hdl.handle.net/11250/2464896
dc.description.abstractComplex out-of-order (OoO) processors have been designed to overcome the restrictions of outstanding long-latency misses at the cost of increased energy consumption. Simple, limited OoO processors are a compromise in terms of energy consumption and performance, as they have fewer hardware resources to tolerate the penalties of long-latency loads. In worst case, these loads may stall the processor entirely. We present Clairvoyance, a compiler based technique that generates code able to hide memory latency and better utilize simple OoO processors. By clustering loads found across basic block boundaries, Clairvoyance overlaps the outstanding latencies to increases memory-level parallelism. We show that these simple OoO processors, equipped with the appropriate compiler support, can effectively hide long-latency loads and achieve performance improvements for memory-bound applications. To this end, Clairvoyance tackles (i) statically unknown dependencies, (ii) insufficient independent instructions, and (iii) register pressure. Clairvoyance achieves a geomean execution time improvement of 14% for memory-bound applications, on top of standard O3 optimizations, while maintaining compute-bound applications' high-performance.nb_NO
dc.language.isoengnb_NO
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)nb_NO
dc.titleStatic Instruction Scheduling for High Performance on Limited Hardwarenb_NO
dc.typeJournal articlenb_NO
dc.typePeer reviewednb_NO
dc.description.versionacceptedVersionnb_NO
dc.source.volumePPnb_NO
dc.source.journalI.E.E.E. transactions on computers (Print)nb_NO
dc.source.issue99nb_NO
dc.identifier.doi10.1109/TC.2017.2769641
dc.identifier.cristin1511573
dc.description.localcode© 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.nb_NO
cristin.unitcode194,63,10,0
cristin.unitnameInstitutt for datateknologi og informatikk
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode2


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel