Posts by YangWen Huang
GEMM Tuning within hipBLASLt– Part 2
- 09 October 2025
This post continues from Part 1 where we introduced GEMM tuning concepts in hipBLASLt and explored the basics of solution search. In Part 2, we focus on offline tuning with the hipblaslt-bench tool. This workflow allows developers to benchmark candidate GEMM kernels for specific problem shapes, capture the best-performing solutions, and reuse them at runtime without rebuilding or modifying the hipBLASLt library.
GEMM Tuning within hipBLASLt - Part 1
- 05 September 2025
When optimizing matrix operations on AMD GPUs using the ROCm platform, tuning specific problem sizes is essential for achieving maximum performance. The hipBLASLt
library supports two official tuning mechanisms: