GEMM Optimization This repo contains code of gemm implemention on M1 pro (use arm NEON intrinsics) Usage cmake -B build cmake --build build -j ./build/gemm-kernel-benchmark Citation