Performance Analysis of Cache-Aware Multicore Parallelization with Application to Optimization Theory

Stensen, Kristoffer

Stensen, Kristoffer

Master thesis

Åpne

566384_COVER01.pdf (184.2Kb)

566384_FULLTEXT01.pdf (1.336Mb)

Permanent lenke

http://hdl.handle.net/11250/253029

Utgivelsesdato

2012

Metadata

Vis full innførsel

Samlinger

Institutt for datateknologi og informatikk [6822]

Sammendrag

In previous work, a cache-aware sparse matrix multiplication for linear programming interior point methods was proposed. The serial implementations achieved speedups ranging from 1.2 to 108.0 over the implementation in GLPK, an open-source linear programming solver. In this work, the same ideas and data structures are used to develop a cache-aware sparse cholesky decomposition as it is implemented in GLPK. The serial implementation achieves a speedup of up to 2.5 on the problem set considered. The matrix multiplication and cholesky decomposition are analysed by use of performance counters on both an AMD-based and an Intel-based system. The analysis shows that the applied blocking techniques reduce the number of floating point operations performed, and that this effect is even more important than the achieved cache utilization to produce speedup for some problems.

Utgiver

Institutt for datateknikk og informasjonsvitenskap