Posts by Mikko Tukiainen
Scaling AI Inference Performance with vLLM on AMD Instinct MI355X GPUs
- 08 December 2025
Today, we are excited to share Large Language Model (LLM) Inference Performance with vLLM on AMD Instinctâ„¢ MI355X GPUs. Whether you are a startup, an enterprise or a hyperscaler, the AMD open software ecosystem with Instinct MI355X GPUs delivers consistent, high-performance inference at scale outperforming Nvidia Blackwell B200 GPUs as concurrency grows. For real-world users, this performance impact is directly proportional to user experience and cost efficiency in production environments.