• Characterizing Multi-Chip GPU Data Sharing 

      Zhang, Shiqing; Naderan-Tahan, Mahmood; Jahre, Magnus; Eeckhout, Lieven (Journal article; Peer reviewed, 2023)
      Multi-chip Graphics Processing Unit (GPU) systems are critical to scale performance beyond a single GPU chip for a wide variety of important emerging applications. A key challenge for multi-chip GPUs, though, is how to ...
    • DCMI: A Scalable Strategy for Accelerating Iterative Stencil Loops on FPGAs 

      Koraei, Mostafa; Fatemi, Omid; Jahre, Magnus (Journal article; Peer reviewed, 2019)
      Iterative Stencil Loops (ISLs) are the key kernel within a range of compute-intensive applications. To accelerate ISLs with Field Programmable Gate Arrays, it is critical to exploit parallelism (1) among elements within ...
    • Delay-on-Squash: Stopping Microarchitectural Replay Attacks in Their Tracks 

      Sakalis, Christos; Kaxiras, Stefanos; Själander, Hans Magnus (Peer reviewed; Journal article, 2022)
      MicroScope and other similar microarchitectural replay attacks take advantage of the characteristics of speculative execution to trap the execution of the victim application in a loop, enabling the attacker to amplify a ...
    • Dependence-aware Slice Execution to Boost MLP in Slice-out-of-order Cores 

      Kumar, Rakesh; Alipour, Mehdi; Black-Schaffer, David (Peer reviewed; Journal article, 2022)
    • Solving Sparse Assignment Problems on FPGAs 

      Jellum, Erling Rennemo; Orlandic, Milica; Brekke, Edmund Førland; Johansen, Tor Arne; Bryne, Torleiv Håland (Peer reviewed; Journal article, 2022)
      The assignment problem is a fundamental optimization problem and a crucial part of many systems. For example, in multiple object tracking, the assignment problem is used to associate object detections with hypothetical ...
    • WaFFLe: Gated Cache-Ways with Per-Core Fine-Grained DVFS for Reduced On-Chip Temperature and Leakage Consumption 

      Chakraborty, Shounak; Själander, Magnus (Peer reviewed; Journal article, 2021)
      Managing thermal imbalance in contemporary chip multi-processors (CMPs) is crucial in assuring functional correctness of modern mobile as well as server systems. Localized regions with high activity, e.g., register files, ...