Show simple item record

dc.contributor.advisorNatvig, Lassenb_NO
dc.contributor.advisorDjupdal, Asbjørnnb_NO
dc.contributor.advisorCebrian, Juan Manuelnb_NO
dc.contributor.authorGuise, Matthewnb_NO
dc.date.accessioned2014-12-19T13:42:11Z
dc.date.available2014-12-19T13:42:11Z
dc.date.created2014-12-07nb_NO
dc.date.issued2014nb_NO
dc.identifier769288nb_NO
dc.identifierntnudaim:11685nb_NO
dc.identifier.urihttp://hdl.handle.net/11250/253966
dc.description.abstractEnergy is one of the most important aspects impacting the reality of reach-ing exascale computing capabilities. In order to build super computers withthis computing power new hardware needs to be considered in their de-sign. One possibility is using hardware designed for mobile and embeddedsystems. In this project, a sorting approach, developed for AVX-512 by Xi-aochen et. al., is implemented both using ARM NEON vectorization andOpenCL. OpenMP is also used. These implementations are profiled on theArndale development board, which houses dual Cortex-A15 ARM proces-sor cores and an ARM Mali T604 GPU on its Exynos 5 System on Chip.These are compared to other sorting algorithm implementations and mea-sured with regards to performance and energy efficiency. It is found thatthe NEON vectorization offer a slight increase in performance when com-pared to a merge sort algorithm without such vectorization. The OpenCLimplementation has the overall poorest performance. The approach imposesrequirements on the input data size which overall make the approach un-favorable on current mobile hardware. The SIMD vector length is deemedan important part in the performance increase being low. Future hardwarewith potentially larger SIMD vector length could see the method be appliedwith more success.nb_NO
dc.languageengnb_NO
dc.publisherInstitutt for datateknikk og informasjonsvitenskapnb_NO
dc.subjectntnudaim:11685no_NO
dc.subjectMTDT Datateknologino_NO
dc.subjectKomplekse datasystemerno_NO
dc.titleEnergy Efficiency and Performance Evaluation of Register Level Bitonic Sort: on ARM Mali Powered Exynos 5 Processornb_NO
dc.typeMaster thesisnb_NO
dc.source.pagenumber69nb_NO
dc.contributor.departmentNorges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskapnb_NO


Files in this item

Thumbnail
Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record