Vis enkel innførsel

dc.contributor.advisorElster, Anne Cathrine
dc.contributor.advisorMeyer, Jan Christian
dc.contributor.authorRogstadkjærnet, Even Olsson
dc.date.accessioned2019-09-11T10:56:25Z
dc.date.available2019-09-11T10:56:25Z
dc.date.created2018-08-01
dc.date.issued2018
dc.identifierntnudaim:20171
dc.identifier.urihttp://hdl.handle.net/11250/2615870
dc.description.abstractAs the adoption of parallel and heterogeneous systems increases, programming such systems also becomes increasingly complex. Frameworks like CUDA and OpenCL provides functional portability across their supported devices. However, having the same code run optimally across multiple devices with different architectures, including being able to port code fairly seamlessly and efficiently to other GPU device architectures, is not provided. This challenge, known as performance portability, is significant since GPU architectures tend to get updated and vary even more than CPU architectures. By transforming optimizations into tuning parameters that can be applied statically by the compiler, an auto-tuner can be used to pick the best combination of optimizations for each architecture. This strategy has earlier been explored using the ImageCL language and compiler, which moves much of the complexity away from the programmer by abstracting away many optimizations which would normally have to be applied manually. In this thesis, we extend the ImageCL language and compiler to support a broader range of applications. These extensions will be guided by miniAMR, a proxy application with the performance characteristics of an Adaptive Mesh Refinement (AMR) application. AMR is a computational method used for adapting the accuracy within certain regions of a domain, and is often used in scientific and engineering applications. We generate multiple GPU stencil kernels from ImageCL code and integrate them into the miniAMR application. We are able to show a considerable speedup (up to 6.78x) for many of the generated stencil kernels in miniAMR compared to the reference implementation.en
dc.languageeng
dc.publisherNTNU
dc.subjectDatateknologi (2 årig), Algoritmer og HPCen
dc.titleImageCL 3D Extensions Targeting Adaptive Mesh Refinement Proxy Applications on GPUsen
dc.typeMaster thesisen
dc.source.pagenumber102
dc.contributor.departmentNorges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi og elektroteknikk,Institutt for datateknologi og informatikknb_NO


Tilhørende fil(er)

Thumbnail
Thumbnail
Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel