DCMI: A Scalable Strategy for Accelerating Iterative Stencil Loops on FPGAs

Koraei, Mostafa; Fatemi, Omid; Jahre, Magnus

Koraei, Mostafa; Fatemi, Omid; Jahre, Magnus

Journal article, Peer reviewed

Published version

Åpne

Koraei (Låst)

Permanent lenke

http://hdl.handle.net/11250/2626772

Utgivelsesdato

2019

Sammendrag

Iterative Stencil Loops (ISLs) are the key kernel within a range of compute-intensive applications. To accelerate ISLs with Field Programmable Gate Arrays, it is critical to exploit parallelism (1) among elements within the same iteration and (2) across loop iterations. We propose a novel ISL acceleration scheme called Direct Computation of Multiple Iterations (DCMI) that improves upon prior work by pre-computing the effective stencil coefficients after a number of iterations at design time—resulting in accelerators that use minimal on-chip memory and avoid redundant computation. This enables DCMI to improve throughput by up to 7.7× compared to the state-of-the-art cone-based architecture.

Utgiver

Association for Computing Machinery (ACM)

Tidsskrift

ACM Transactions on Architecture and Code Optimization (TACO)