Posts by Jiahui Cao
FP8 GEMM Optimization on AMD CDNA™4 Architecture
- 10 March 2026
This blog post continues our previous blog Matrix Core Programming on AMD CDNA™3 and CDNA™4 Architecture, which introduced Matrix Cores and demonstrated how to use them in HIP kernels.