Posts by Mikko Tukiainen

Scaling AI Inference Performance with vLLM on AMD Instinct MI355X GPUs

Today, we are excited to share Large Language Model (LLM) Inference Performance with vLLM on AMD Instinctâ„¢ MI355X GPUs. Whether you are a startup, an enterprise or a hyperscaler, the AMD open software ecosystem with Instinct MI355X GPUs delivers consistent, high-performance inference at scale outperforming Nvidia Blackwell B200 GPUs as concurrency grows. For real-world users, this performance impact is directly proportional to user experience and cost efficiency in production environments.

Read more ...