Posts tagged Linear Algebra

SmoothQuant model inference on AMD Instinct MI300X using Composable Kernel

The AMD ROCm™ Composable Kernel (CK) library provides a programming model for writing performance-critical kernels for machine learning workloads. It generates a general-purpose kernel during the compilation phase through a C++ template, enabling developers to achieve operation fusions on different data precisions.

Read more ...


Sparse matrix vector multiplication - part 1

Note: This blog was previously part of the AMD lab notes blog series.

Read more ...


Jacobi Solver with HIP and OpenMP offloading

Note: This blog was previously part of the AMD lab notes blog series.

Read more ...


AMD matrix cores

Note: This blog was previously part of the AMD lab notes blog series.

Read more ...