Posts by Jiaxin Wang
Accelerating IBM Granite 4.0 with FP8 using AMD Quark on MI300/MI355 GPUs
- 09 January 2026
In this post, we demonstrate how AMD Quark, a high-performance quantization library optimized for AMD Instinctâ„¢ MI300 and MI355 GPUs, enables FP8 quantization to deliver excellent accuracy retention and substantial throughput uplift for the IBM Granite 4.0 model family. For instructions on deploying Granite 4.0 on AMD GPUs, please refer to the previous blog post.