Vis enkel innførsel

dc.contributor.authorUmuroglu, Yaman
dc.contributor.authorDavide, Conficconi
dc.contributor.authorRasnayake, Lahiru
dc.contributor.authorPreusser, Thomas B.
dc.contributor.authorSjälander, Magnus
dc.date.accessioned2020-04-08T09:54:00Z
dc.date.available2020-04-08T09:54:00Z
dc.date.created2019-11-29T08:28:32Z
dc.date.issued2019
dc.identifier.issn1936-7406
dc.identifier.urihttps://hdl.handle.net/11250/2650745
dc.description.abstractMatrix-matrix multiplication is a key computational kernel for numerous applications in science and engineering, with ample parallelism and data locality that lends itself well to high-performance implementations. Many matrix multiplication-dependent applications can use reduced-precision integer or fixed-point representations to increase their performance and energy efficiency while still offering adequate quality of results. However, precision requirements may vary between different application phases or depend on input data, rendering constant-precision solutions ineffective. BISMO, a vectorized bit-serial matrix multiplication overlay for reconfigurable computing, previously utilized the excellent binary-operation performance of FPGAs to offer a matrix multiplication performance that scales with required precision and parallelism. We show how BISMO can be scaled up on Xilinx FPGAs using an arithmetic architecture that better utilizes six-input LUTs. The improved BISMO achieves a peak performance of 15.4 binary TOPS on the Ultra96 board with a Xilinx UltraScale+ MPSoC.en_US
dc.language.isoengen_US
dc.publisherAssociation for Computing Machinery (ACM)en_US
dc.titleOptimizing Bit-Serial Matrix Multiplication for Reconfigurable Computingen_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionpublishedVersionen_US
dc.source.volume12en_US
dc.source.journalACM Transactions on Reconfigurable Technology and Systemsen_US
dc.source.issue3en_US
dc.identifier.doi10.1145/3337929
dc.identifier.cristin1754205
dc.description.localcodeThis article will not be available due to copyright restrictions (c) 2019 by Association for Computing Machinery (ACM)en_US
cristin.unitcode194,63,10,0
cristin.unitnameInstitutt for datateknologi og informatikk
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel