Lecture 13
Learning objectives
After this class, you should be able to:
- Use compiler flags, standard library functions, and SIMD instrinsics to optimize your code.
Reading assignment
- Lecture 13 slides.
- Article on memory optimization.
- Lecture 13 example codes.
Exercises and review questions
- Questions on current lecture's material
- Run the lecture 13 example codes on
gpu
and read the codes.- Search the following assembly codes:
Lec13/MM.s
andLec13/MM_sse.s
for the SSE SIMD instruction for single precision multiplication:mulps
. Explain why the first code has worse performance than the second one. (Note: You can disassemble executables on Linux usingobjdump -d
, in case you wish to perform similar analysis on other executables.)- Questions on next lecture's material
- None.