Advanced algorithmic techniques for GPUs
University of Illinois, , Prof. Wen-mei Hwu
Updated On 02 Feb, 19
University of Illinois, , Prof. Wen-mei Hwu
Updated On 02 Feb, 19
4.1 ( 11 )
The GPU architecture is characterized by memory access bandwidth that, although fast, is often limiting in comparison to compute throughput. Thus, achieving performance critically depends on finding ways to reduce and regularize global memory access. Three important algorithmic strategies for conserving bandwidth are �registermemory tiling�, �layout transformation� and �thread coarsening�. These come at a cost of increased on-chip memory usage, which is also a limited resource. We will discuss a variety of examples from PDE solvers, linear algebra, and convolution.
Sam
Sep 12, 2018
Excellent course helped me understand topic that i couldn't while attendinfg my college.
Dembe
March 29, 2019
Great course. Thank you very much.