AI Blogs - Page 11

AI Blogs - Page 11#

July 09, 2025

Nitro-T: Training a Text-to-Image Diffusion Model from Scratch in 1 Day

Nitro-T is a family of text-to-image diffusion models developed by AMD to demonstrate efficient large-scale training on Instinct™ MI300X GPUs. Trained from scratch in under 24 hours

./artificial-intelligence/nitro-t-diffusion/README.html

July 07, 2025

vLLM V1 Meets AMD Instinct GPUs: A New Era for LLM Inference Performance

vLLM v1 on AMD ROCm boosts LLM serving with faster TTFT, higher throughput, and optimized multimodal support—ready out of the box.

./software-tools-optimization/vllmv1-rocm-llm/README.html

July 03, 2025

Unlocking GPU-Accelerated Containers with the AMD Container Toolkit

Simplify GPU acceleration in containers with the AMD Container Toolkit—streamlined setup, runtime hooks, and full ROCm integration.

./software-tools-optimization/amd-container-toolkit/README.html

June 28, 2025

Accelerated LLM Inference on AMD Instinct™ GPUs with vLLM 0.9.x and ROCm

vLLM v0.9.x is here with major ROCm™ optimizations—boosting LLM performance, reducing latency, and expanding model support on AMD Instinct™ GPUs.

./software-tools-optimization/vllm-0.9.x-rocm/README.html

June 20, 2025

Enabling Real-Time Context for LLMs: Model Context Protocol (MCP) on AMD GPUs

Learn how to leverage Model Context Protocol (MCP) servers to provide real time context information to LLMs through a chatbot example on AMD GPUs

./artificial-intelligence/mcp-model-context-protocol/README.html

June 18, 2025

Continued Pretraining: A Practical Playbook for Language-Specific LLM Adaptation

A step by step guide to adapting LLMs to new languages via continued pretraining, with Poro 2 boosting Finnish performance using Llama 3.1 and AMD GPUs

./artificial-intelligence/multilingual-continued-pretraining/README.html

June 18, 2025

Fine-Tuning LLMs with GRPO on AMD MI300X: Scalable RLHF with Hugging Face TRL and ROCm

Fine-tune LLMs with GRPO on AMD MI300X—leverage ROCm, Hugging Face TRL, and vLLM for efficient reasoning and scalable RLHF

./software-tools-optimization/llm-grpo-rocm/README.html

June 12, 2025

Aligning Mixtral 8x7B with TRL on AMD GPUs

This blog demonstrates how to fine-tune and align Mixtral 8x7B with TRL using DPO and evaluate it on AMD GPUs.

./artificial-intelligence/finetuning-trl-dpo/README.html

June 11, 2025

Introducing Instella-Long: A Fully Open Language Model with Long-Context Capability

Learn about Instella-Long: AMD’s open 3B language model supporting 128K context, trained on MI300X GPUs, outperforming peers on long-context benchmarks.

./artificial-intelligence/instella-long-context/README.html

June 10, 2025

AMD ROCm: Powering the World's Fastest Supercomputers

Discover how ROCm drives the world’s top supercomputers, from El Capitan to Frontier, and why its shaping the future of scalable, open and sustainable HPC

./ecosystems-and-partners/rocm-revisited-power/README.html

June 09, 2025

LLM Quantization with Quark on AMD GPUs: Accuracy and Performance Evaluation

Learn how to use Quark to apply FP8 quantization to LLMs on AMD GPUs, and evaluate accuracy and performance using vLLM and SGLang on AMD MI300X GPUs.

./artificial-intelligence/quark/README.html

June 06, 2025

The ROCm Revisited Series

We present our ROCm Revisited Series. Discover ROCm's role in leading edge supercomputing, its growing ecosystem-from HIP, to developer tools-powering AI, HPC, and data science across multi-GPU and cluster systems

./ecosystems-and-partners/rocm-revisited/README.html

Prev Page 11 of 23 Next