Lecture 8
Learning objectives
After this class, you should be able to:
- Describe the following CUDA feature:
cudaMemcpyToSymbol
.- Use the above feature to write code that uses constant memory efficiently.
Reading assignment
- Read UIUC Lecture 8 slides.
- Chapter 8 of text, until (and including) section 8.3.
Exercises and review questions
- Exercises and review questions on current lecture's material
- Implement the 1-D convolution kernel and compare the performance with and without shared memory.
- Consider performing 1-D convolution on array
{4, 1, 3, 2, 3}
with with mask{2, 1, 4}
. What is the resulting output array?- How many halo cells are there when we perform 2-D convolution on an
n x n
matrix with anm x m
mask?- Preparation for the next lecture
- How can the algorithm discussed in class be improved?.
Last modified: 31 Jan 2013