Tushar Gautam - 0Mean1Sigma (Page 2)

28 Nov 2024 8 min read

What is SGeMM

SGeMM stands for Single-Precision General Matrix Multiplication. Let's analyze matrix multiplication on a CPU and a GPU.

28 Nov 2024 12 min read

Parallel matrix multiplication using CUDA C++.

28 Nov 2024 11 min read

Memory coalescing is the most crucial concept in GPU programming. With matrix multiplication, we can get upwards of 7x improvement.

28 Nov 2024 11 min read

Tiled matrix multiplication using GPU shared memory.

28 Nov 2024 8 min read

Thread registers are used to increase the performance of matrix multiplication by another 4x.