Browsing Fakultet for informasjonsteknologi og elektroteknikk (IE) by Journals "ACMSIGPLAN Symposium on Principles and Practice of Parallel Programming"
Now showing items 1-2 of 2
-
Register-based Implementation of the Sparse General Matrix-matrix Multiplication on GPUs
(Journal article; Peer reviewed, 2018)General sparse matrix-matrix multiplication (SpGEMM) is an essential building block in a number of applications. In our work, we fully utilize GPU registers and shared memory to implement an efficient and load balanced ... -
swSpTRSV: A Fast Sparse Triangular Solve with Sparse Level Tile Layout on Sunway Architectures
(Journal article; Peer reviewed, 2018)Sparse triangular solve (SpTRSV) is one of the most important kernels in many real-world applications. Currently, much research on parallel SpTRSV focuses on level-set construction for reducing the number of inter-level ...