Lecture 17

Learning objectives

After this class, you should be able to:

  1. Optimize parallel code using double buffering.
  2. Optimize large loops to reduce instruction cache misses.
  3. Use domain decomposition techniques to distribute data assigned to different processes or threads in a parallel computation.
  4. Use dynamic load balancing techniques to assign roughly equal work to each process or thread in a parallel computation.

Reading assignment

  1. Read the paper on graph partitioning under the 'Course Library' tab on Blackboard.

Exercises and review questions


Last modified: 17 Mar 2010