Mini Projects

Each mini project has a clear set of goals. Explore and understand different concepts practically while working towards achieving those goals.

Mini Projects
Photo by Erik Mclean / Unsplash

xGeMM

GitHub - tgautam03/xGeMM: Accelerated General (FP32) Matrix Multiplication
Accelerated General (FP32) Matrix Multiplication. Contribute to tgautam03/xGeMM development by creating an account on GitHub.

Goal: Code matrix multiplication from scratch and (try to) match the performance of cuBLAS SGeMM.

Content Index

  1. Custom Data Structure: MatrixFP32 (and utility function)
  2. Naive CPU implementation vs BLAS level implementation (using Eigen)