The core NVIDIA-candidate course: CUDA's programming model, memory hierarchy, coalescing, kernels, profiling, and matrix multiplication optimization.