Searched for: subject%3A%22caching%22
(1 - 1 of 1)
document
Xu, S. (author), Xue, W. (author), Lin, H.X. (author)
In this article, we discuss the performance modeling and optimization of Sparse Matrix-Vector Multiplication (SpMV) on NVIDIA GPUs using CUDA. SpMV has a very low computation-data ratio and its performance is mainly bound by the memory bandwidth. We propose optimization of SpMV based on ELLPACK from two aspects: (1) enhanced performance for the...
journal article 2011