Lecture 25
Learning objectives
After this class, you should be able to:
- Write simple code that combines MPI and CUDA, and compile and run it on XSEDE machines.
Reading assignment
- Lec25 example code.
- UIUC Lecture 27.
- Chapter 19 of text.
Exercises and review questions
- Exercises and review questions on current lecture's material
- Modify the
vec-add
program so that it uses zero-copy. Compare its performance against that of the code without zero-copy and report it on the discussion board. If you don't report the results of this task, then you will need to combine CUDA and MPI in your project.- Preparation for the next lecture
- What is the OpenCL equivalent of CUDA's
threadIdx.x
?- What is the OpenCL equivalent of CUDA's
blockDim.x
?
Last modified: 11 Apr 2013