Posts by Stig-Arne Gronroos

Scaling AI Inference Performance with vLLM on AMD Instinct MI355X GPUs

Today, we are excited to share Large Language Model (LLM) Inference Performance with vLLM on AMD Instinctâ„¢ MI355X GPUs. Whether you are a startup, an enterprise or a hyperscaler, the AMD open software ecosystem with Instinct MI355X GPUs delivers consistent, high-performance inference at scale outperforming Nvidia Blackwell B200 GPUs as concurrency grows. For real-world users, this performance impact is directly proportional to user experience and cost efficiency in production environments.

Read more ...