[Study] CUDA Kernel Optimization — Memory Access Patterns
· One min read
While studying deep learning inference optimization, I explored how memory access patterns in CUDA kernels dramatically affect performance.
While studying deep learning inference optimization, I explored how memory access patterns in CUDA kernels dramatically affect performance.