Lecture 18
Learning objectives
After this class, you should be able to:
- Explain the performance implications of atomic operations.
- Use shared memory to reduce the overhead of atomic operations.
Reading assignment
- UIUC Lecture 19.
Exercises and review questions
- Exercises and review questions on current lecture's material
- (i) Write code to perform reduction using atomic operations on global memory. (ii) Rewrite this code to use shared memory in order to reduce the overhead of atomic operations. (iii) Compare the performances of the above two with the reduction algorithm we had discussed earlier. Report your performance results on the discussion board.
- Write code with different threads performing atomic operations on different, but adjacent, locations in memory. Compare the performance with different data sizes. Report your performance results on the discussion board.
- Preparation for the next lecture
- How many lanes are there in the PCIe that connects a GPU to the host on gpu.cs.fsu.edu?
Last modified: 14 Mar 2013