Recent Posts - Page 2#

Fine-tuning Robotics Vision Language Action Models with AMD ROCm and LeRobot

Speed up robotics AI with AMD ROCm and LeRobot: fine-tune VLAs on Instinct GPUs and deploy on Ryzen AI. Follow the tutorial to get started.

July 14, 2025 by Abby O'Neill, Sarunas Kalade, Ken O'Brien, Graham Schelle

Accelerating Video Generation on ROCm with Unified Sequence Parallelism: A Practical Guide

A practical guide for accelerating video generation with HunyuanVideo and Wan 2.1 using Unified Sequence Parallelism on AMD GPUs.

July 11, 2025 by Clint Greene

Nitro-T: Training a Text-to-Image Diffusion Model from Scratch in 1 Day

Nitro-T is a family of text-to-image diffusion models developed by AMD to demonstrate efficient large-scale training on Instinct™ MI300X GPUs. Trained from scratch in under 24 hours

July 09, 2025 by Akash Haridas, Tong Shen, Jingai Yu

vLLM V1 Meets AMD Instinct GPUs: A New Era for LLM Inference Performance

vLLM v1 on AMD ROCm boosts LLM serving with faster TTFT, higher throughput, and optimized multimodal support—ready out of the box.

July 07, 2025 by Seungrok Jung, Hyukjoon Lee, Andy Luo

Unlocking GPU-Accelerated Containers with the AMD Container Toolkit

Simplify GPU acceleration in containers with the AMD Container Toolkit—streamlined setup, runtime hooks, and full ROCm integration.

July 03, 2025 by Abhishek Patil

Accelerated LLM Inference on AMD Instinct™ GPUs with vLLM 0.9.x and ROCm

vLLM v0.9.x is here with major ROCm™ optimizations—boosting LLM performance, reducing latency, and expanding model support on AMD Instinct™ GPUs.

June 28, 2025 by Hongxia Yang, Peng Sun, Tun Jian Tan, Pin Siang Tan, Anshul Gupta

Performance Profiling on AMD GPUs – Part 1: Foundations

Part 1 of our GPU profiling series introduces ROCm tools, setup steps, and key concepts to prepare you for deeper dives in the posts to follow.

June 26, 2025 by Gina Sitaraman, Thomas Gibson, Luka Stanisic, Giacomo Capodaglio, Alessandro Fanfarillo, Asitav Mishra

Enabling Real-Time Context for LLMs: Model Context Protocol (MCP) on AMD GPUs

Learn how to leverage Model Context Protocol (MCP) servers to provide real time context information to LLMs through a chatbot example on AMD GPUs

June 20, 2025 by Fabricio Flores

Continued Pretraining: A Practical Playbook for Language-Specific LLM Adaptation

A step by step guide to adapting LLMs to new languages via continued pretraining, with Poro 2 boosting Finnish performance using Llama 3.1 and AMD GPUs

June 18, 2025 by Elaine Zosa, Jouni Luoma, Kai Hakala, Antti Virtanen, Mika Koistinen, Jonathan Burdge

Fine-Tuning LLMs with GRPO on AMD MI300X: Scalable RLHF with Hugging Face TRL and ROCm

Fine-tune LLMs with GRPO on AMD MI300X—leverage ROCm, Hugging Face TRL, and vLLM for efficient reasoning and scalable RLHF

June 18, 2025 by Zhu Shan, George Wang

Aligning Mixtral 8x7B with TRL on AMD GPUs

This blog demonstrates how to fine-tune and align Mixtral 8x7B with TRL using DPO and evaluate it on AMD GPUs.

June 12, 2025 by Clint Greene

Introducing Instella-Long: A Fully Open Language Model with Long-Context Capability

Learn about Instella-Long: AMD’s open 3B language model supporting 128K context, trained on MI300X GPUs, outperforming peers on long-context benchmarks.

June 11, 2025 by Jialian Wu, Jiang Liu, Sudhanshu Ranjan, Xiaodong Yu, Gowtham Ramesh, Prakamya Mishra, Zicheng Liu, Yusheng Su, Ximeng Sun, Ze Wang, Emad Barsoum

Prev Page 2 of 18 Next