Show simple item record

dc.contributor.authorAaker, Ole Edvard
dc.contributor.authorRaknes, Espen Birger
dc.contributor.authorArntsen, Børge
dc.date.accessioned2021-02-12T15:44:33Z
dc.date.available2021-02-12T15:44:33Z
dc.date.created2020-07-02T13:41:29Z
dc.date.issued2020
dc.identifier.citationJournal of Supercomputing. 2020, .en_US
dc.identifier.issn0920-8542
dc.identifier.urihttps://hdl.handle.net/11250/2727867
dc.description.abstractFull waveform inversion (FWI) is a procedure used to determine the elastic parameters of the Earth by reducing the misfit between observed elastodynamic wavefields and their numerically modelled counterparts. The numerical solution of the elastodynamic wave equation is computationally expensive, and its performance is typically bandwidth bound. Computing the gradient of the FWI misfit functional adds further complexity as it involves computing the zero-lag cross-correlation of two wavefields propagating in opposite temporal directions. In this paper, we utilize graphics processing units (GPUs) for their high memory bandwidth and combine two principal optimizations in order to compute FWI gradients on large models and for long simulation times. Wavefield reconstruction methods allow efficient gradient computations with minimal memory requirements and interconnection transfers. Time-space tiling techniques permit us to transcend the limited amount of GPU memory while avoiding dramatic slowdowns due to the low interconnection bandwidth. The implementation considers a task-oriented, hybrid usage of explicitly managed and Unified Memory in order to satisfy the requirements. Benchmarks demonstrate that the proposed approach is able to preserve 78–90% of the original performance, when oversubscribing the amount of physical memory available on GPUs. Comparison with existing methods highlights the benefits of the method.en_US
dc.language.isoengen_US
dc.publisherSpringeren_US
dc.titleElastodynamic full waveform inversion on GPUs with time-space tiling and wavefield reconstructionen_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionacceptedVersionen_US
dc.source.pagenumber42en_US
dc.source.journalJournal of Supercomputingen_US
dc.identifier.doi10.1007/s11227-020-03352-5
dc.identifier.cristin1818295
dc.relation.projectNorges forskningsråd: 291192en_US
dc.description.localcodeThis is a post-peer-review, pre-copyedit version of an article. Locked until 17/6-2021 due to copyright restrictions. The final authenticated version is available online at:http://dx.doi.org/10.1007/s11227-020-03352-5en_US
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode1


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record