• norsk
    • English
  • English 
    • norsk
    • English
  • Login
View Item 
  •   Home
  • Fakultet for informasjonsteknologi og elektroteknikk (IE)
  • Institutt for datateknologi og informatikk
  • View Item
  •   Home
  • Fakultet for informasjonsteknologi og elektroteknikk (IE)
  • Institutt for datateknologi og informatikk
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Energy Efficiency and Performance Evaluation of Register Level Bitonic Sort: on ARM Mali Powered Exynos 5 Processor

Guise, Matthew
Master thesis
View/Open
769288_ATTACHMENT01.zip (Locked)
769288_COVER01.pdf (Locked)
769288_FULLTEXT01.pdf (Locked)
URI
http://hdl.handle.net/11250/253966
Date
2014
Metadata
Show full item record
Collections
  • Institutt for datateknologi og informatikk [4881]
Abstract
Energy is one of the most important aspects impacting the reality of reach-ing exascale computing capabilities. In order to build super computers withthis computing power new hardware needs to be considered in their de-sign. One possibility is using hardware designed for mobile and embeddedsystems. In this project, a sorting approach, developed for AVX-512 by Xi-aochen et. al., is implemented both using ARM NEON vectorization andOpenCL. OpenMP is also used. These implementations are profiled on theArndale development board, which houses dual Cortex-A15 ARM proces-sor cores and an ARM Mali T604 GPU on its Exynos 5 System on Chip.These are compared to other sorting algorithm implementations and mea-sured with regards to performance and energy efficiency. It is found thatthe NEON vectorization offer a slight increase in performance when com-pared to a merge sort algorithm without such vectorization. The OpenCLimplementation has the overall poorest performance. The approach imposesrequirements on the input data size which overall make the approach un-favorable on current mobile hardware. The SIMD vector length is deemedan important part in the performance increase being low. Future hardwarewith potentially larger SIMD vector length could see the method be appliedwith more success.
Publisher
Institutt for datateknikk og informasjonsvitenskap

Contact Us | Send Feedback

Privacy policy
DSpace software copyright © 2002-2019  DuraSpace

Service from  Unit
 

 

Browse

ArchiveCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsDocument TypesJournalsThis CollectionBy Issue DateAuthorsTitlesSubjectsDocument TypesJournals

My Account

Login

Statistics

View Usage Statistics

Contact Us | Send Feedback

Privacy policy
DSpace software copyright © 2002-2019  DuraSpace

Service from  Unit