x
Menu

Advanced algorithmic techniques for GPUs

University of Illinois, , Prof. Wen-mei Hwu

Updated On 02 Feb, 19

Overview

Includes

Lecture 2: Lecture 2 Parallelism transformations for performance

4.1 ( 11 )


Lecture Details

Often domain problems have inherent parallelism that needs to be recognized. The most efficient implementation that exploits the problem�s parallelism may be non-intuitive. For example, two alternative thread arrangements that appear in electrostatics calculations have, respectively, scatter and gather memory access behavior. The first is more intuitive, but the second is much more efficient on the GPU architecture.

Ratings

0


0 Ratings
55%
30%
10%
3%
2%
Comments
comment person image

Sam

Excellent course helped me understand topic that i couldn't while attendinfg my college.

Reply
comment person image

Dembe

Great course. Thank you very much.

Reply
Send