Featured Posts
AMD Enterprise AI Suite: Open Infrastructure for Production AI
Explore an open, GPU-optimized platform to build, deploy, and scale enterprise AI workloads on AMD Instinct with production-ready performance.
Technical Dive into AMD MLPerf Training v5.1 Submission
Learn about the technical details of how AMD achieved the results in the MLPerf Training v5.1 submission.
ROCm 7.9 Technology Preview: ROCm Core SDK and TheRock Build System
Introduce ROCm Core SDK, and learn to install and build ROCm components easily using TheRock.
Matrix Core Programming on AMD CDNA™3 and CDNA™4 architecture
This blog post explains how to use Matrix Cores on CDNA3 and CDNA4 architecture, with a focus on low-precision data types such as FP16, FP8, and FP4
Inference with HunyuanWorld-Voyager on AMD Instinct GPUs
Learn how to run inference with HunyuanWorld-Voyager, a state-of-the-art 3D world model, on AMD Instinct MI300X GPUs using ROCm for efficient video generation and prediction.
Accelerating AI-Driven Crystalline Materials Design with MatterGen on AMD Instinct MI300X
Learn how to run a generative model for inorganic material design with MatterGen on MI300X.
AMD Inference Microservice (AIM): Production Ready Inference on AMD Instinct™ GPUs
Learn how AIM delivers efficient, scalable inference on AMD Instinct GPUs and see how it simplifies deployment, optimization, and operations.
Plug-and-Play CuPy on ROCm: Data Analytics Acceleration Made Simple
Learn about how to enhance your analytics project with the latest AMD CuPy release.
Democratizing AI Compute with AMD Using SkyPilot
Learn how SkyPilot integrates with AMD open AI stack to enable seamless multi-cloud deployment and simplifies NVIDIA-to-AMD GPU migration.
Continuing the Momentum: Refining ROCm For The Next Wave Of AI and HPC
ROCm 7.1 builds on 7.0’s AI and HPC advances with faster performance, stronger reliability, and streamlined tools for developers and system builders.
ROCm 7.0: An AI-Ready Powerhouse for Performance, Efficiency, and Productivity
Discover how ROCm 7.0 integrates AI across every layer, combining hardware enablement, frameworks, model support, and a suite of optimized tools
Llama.cpp Meets Instinct: A New Era of Open-Source AI Acceleration
performance optimizations for llama.cpp on AMD Instinct GPUs
Reproducing AMD MLPerf Training v5.1 Submission Result
Learn how to reproduce AMD's MLPerf Training v5.1 submission result.
Training AI Weather Forecasting Models on AMD Instinct
Learn how deterministic and generative AI models for synoptic-scale weather forecasting are trained efficiently on AMD Instinct MI300X GPUs using the ROCm and GeoArches tools.
Day 0 Developer Guide: hipBLASLt Offline GEMM Tuning Script
Learn how to improve model performance with hipBLASLt offline tuning in our easy-to-use Day 0 tool for developers to optimize GEMM efficiency
Retrieval Augmented Generation (RAG) with vLLM, LangChain and Chroma
Learn AI-powered knowledge retrieval that enriches prompts with proprietary data to deliver accurate and context-aware answers
Accelerating Vector Search: hipVS and hipRAFT on AMD
Learn how hipVS accelerates vector search on AMD Instinct GPUs, with notebook demos for semantic search, RAG, and recommendation systems.
Practical, Fault‑Robust Distributed Inference for DeepSeek on AMD MI300X
Learn how a small-radius expert parallel design with prefill–decode disaggregation enables scalable, fault-isolated LLM inference on AMD Instinct™ MI300X clusters.
Stability at Scale: AMD’s Full‑Stack Platform for Large‑Model Training
Primus streamlines LLM training on AMD GPUs with unified configs, multi-backend support, preflight validation, and structured logging.
High-Accuracy MXFP4, MXFP6, and Mixed-Precision Models on AMD GPUs
Learn to leverage AMD Quark for efficient MXFP4/MXFP6 quantization on AMD Instinct accelerators with high accuracy retention.
Stay informed
- Subscribe to our RSS feed (Requires an RSS reader available as browser plugins.)
- Signup for the ROCm newsletter
- View our blog statistics
- View the ROCm Developer Hub
- Report an issue or request a feature
- We are eager to learn from our community! If you would like to contribute to the ROCm Blogs, please submit your technical blog for review at our GitHub. Blog creation can be started through our GitHub user guide.