Heterogeneous FTDT for Seismic Processing

Skomedal, Andreas Berg

dc.contributor.advisor	Elster, Anne Cathrine	nb_NO
dc.contributor.author	Skomedal, Andreas Berg	nb_NO
dc.date.accessioned	2014-12-19T13:40:12Z
dc.date.available	2014-12-19T13:40:12Z
dc.date.created	2013-10-12	nb_NO
dc.date.issued	2013	nb_NO
dc.identifier	655628	nb_NO
dc.identifier	ntnudaim:9486	nb_NO
dc.identifier.uri	http://hdl.handle.net/11250/253360
dc.description.abstract	In the early days of computing, scientific calculations were done by specializedhardware. More recently, increasingly powerful CPUs took over and have beendominant for a long time. Now though, scientific computation is not only forthe general CPU environment anymore. GPUs are specialized processors withtheir own memory hierarchy requiring more effort to program, but for suitablealgorithms they may significantly outperform serially optimized CPUs. In recentyears, these GPUs have become a lot more easily programmable, where they in thepast had to be programmed through the abstraction of a graphics pipeline.EMGS in Trondheim is an oil-finding service working with analysis of seismicreadings of the ocean floor, to provide information about possible oil reservoirs.Data-centers comprised of CPU nodes does all the work today, however GPUinstallations could be more cost effective and faster.In this thesis we look at the implementation of the main part of one of theirdata analysis algorithms. For this we use the FDTD method implemented inYee bench by Ulf Andersson. We look at how to adapt it for GPU using CUDA,parallelize the CPU implementations and how to run this efficiently togetherheterogeneously.It is shown that this method has great potential for use on GPUs, speedupsjust short of 19x over single thread CPU are achieved in this work. The FDTDmethod we use does however have some erratic memory operations which limitsour performance compared to great GPU implementations these days which canreach speedups of over 100x. However, many of them still compare to singleCPU performance. The order in which we address memory is therefore evenmore important, we show that optimizing memory writes when half the memoryreads will not coalesce still improves our performance considerably. We show thatcare is needed when scheduling jobs on both CPU and GPU on the same node toavoid the total performance going down. Using all available resources on the hostmay not be beneficial. Utilizing several parallel CUDA streams proves effective tohide a lot of overhead and delay caused by busy CPU and main memory.This work is not a final solution for EMGS? needs for this tool, other consid-erations and options than those discussed are also of interest. These topics areincluded in the future work section.	nb_NO
dc.language	eng	nb_NO
dc.publisher	Institutt for datateknikk og informasjonsvitenskap	nb_NO
dc.title	Heterogeneous FTDT for Seismic Processing	nb_NO
dc.type	Master thesis	nb_NO
dc.source.pagenumber	95	nb_NO
dc.contributor.department	Norges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskap	nb_NO

Tilhørende fil(er)

Filnavn:: 655628_FULLTEXT01.pdf
Størrelse:: 1.058Mb
Format:: PDF

Åpne

Filnavn:: 655628_COVER01.pdf
Størrelse:: 184.1Kb
Format:: PDF

Åpne

Filnavn:: 655628_ATTACHMENT01.zip
Størrelse:: 20.97Kb
Format:: Ukjent

Åpne

Denne innførselen finnes i følgende samling(er)

Institutt for datateknologi og informatikk [6771]

Vis enkel innførsel