Performance and Energy Efficiency of CUDA and OpenCL for GPU Computing using Python
Peer reviewed, Journal article
MetadataShow full item record
Original versionAdvances in Parallel Computing. 2020, 36 593-604. 10.3233/APC200089
In this work, we examine the performance and energy efficiency when using Python for developing HPC codes running on the GPU. We investigate the portability of performance and energy efficiency between CUDA and OpenCL; between GPU generations; and between low-end, mid-range and high-end GPUs. Our findings show that for some combinations of GPU and GPU code, there is a significant speedup for CUDA over OpenCL, but that this does not hold in general. Our experiments show that performance in general varies more between different GPUs, than between using CUDA and OpenCL. Finally, we show that tuning for performance is a good way of tuning for energy efficiency.