Mixed Precision in Heterogeneous Parallel Computing Platforms via Delayed Code Analysis
Cattaneo, Daniele; Maggioli, Alberto; Magnani, Gabriele; Denisov, Lev; Yang, Shufan; Agosta, Giovanni; Cherubin, Stefano
Journal article
Submitted version
Permanent lenke
https://hdl.handle.net/11250/3131538Utgivelsesdato
2023Metadata
Vis full innførselSamlinger
Originalversjon
10.1007/978-3-031-46077-7_33Sammendrag
Mixed Precision techniques have been successfully applied to improve the performance and energy efficiency of computation in embedded and high performance systems. However, few solutions have been proposed that address precision tuning of both GPGPU code and its corresponding CPU code, limiting the gains achievable by mixed precision. We propose an extension to the TAFFO precision tuning toolset that enables Mixed Precision across the space of floating and fixed point data types on GPGPUs, leveraging static analysis and providing seamless interface adaptation between host and GPGPU kernel code. The proposed tool achieves speedups exceeding 2x by exploiting the optimization of both kernel and host code.