HPC Blogs

HPC Blogs#

Unlocking GPU-Accelerated Containers with the AMD Container Toolkit

Simplify GPU acceleration in containers with the AMD Container Toolkit—streamlined setup, runtime hooks, and full ROCm integration.

July 03, 2025 by Abhishek Patil

Performance Profiling on AMD GPUs – Part 1: Foundations

Part 1 of our GPU profiling series introduces ROCm tools, setup steps, and key concepts to prepare you for deeper dives in the posts to follow.

June 26, 2025 by Gina Sitaraman, Thomas Gibson, Luka Stanisic, Giacomo Capodaglio, Alessandro Fanfarillo, Asitav Mishra

AMD ROCm: Powering the World's Fastest Supercomputers

Discover how ROCm drives the world’s top supercomputers, from El Capitan to Frontier, and why its shaping the future of scalable, open and sustainable HPC

June 10, 2025 by Mohammed Faraaz Mustafa, Saad Rahim

LLM Quantization with Quark on AMD GPUs: Accuracy and Performance Evaluation

Learn how to use Quark to apply FP8 quantization to LLMs on AMD GPUs, and evaluate accuracy and performance using vLLM and SGLang on AMD MI300X GPUs.

June 09, 2025 by Sean Song

Ecosystems & Partners

The ROCm Revisited Series

We present our ROCm Revisited Series. Discover ROCm's role in leading edge supercomputing, its growing ecosystem-from HIP, to developer tools-powering AI, HPC, and data science across multi-GPU and cluster systems

June 06, 2025 by Mohammed Faraaz Mustafa, Liam Berry, Saad Rahim

ROCm Revisited: Getting Started with HIP

New to HIP? This blog will introduce you to the HIP runtime API, its key concepts and installation and practical code examples to showcase its functionality.

June 06, 2025 by Liam Berry, Mohammed Faraaz Mustafa, Saad Rahim

ROCm Revisited: Evolution of the High-Performance GPU Computing Ecosystem

Learn how ROCm evolved to support HPC, AI, and containerized workloads with modern tools, libraries, and deployment options.

June 06, 2025 by Liam Berry, Saad Rahim

HIP 7.0 Is Coming: What You Need to Know to Stay Ahead

Get ready for HIP 7.0—explore key API changes that boost CUDA compatibility and streamline portable GPU development, start preparing your code today.

May 28, 2025 by Christophe Paquot, Julia Jiang, Denny Iriawan, Saad Rahim

Applications & Models

Seismic stencil codes - part 1

Seismic Stencil Codes - Part 1: Seismic workloads in the HPC space have a long history of being powered by high-order finite difference methods on structured grids. This trend continues to this day.

August 29, 2024 by Justin Chang, Ossian O'Reilly

Seismic stencil codes - part 2

Seismic Stencil Codes - Part 2: In the previous post, recall that the kernel with stencil computation in the z-direction suffered from low effective bandwidth. This low performance comes from generating substantial amounts of data to movement to global memory.

August 29, 2024 by Justin Chang, Ossian O'Reilly

Seismic stencil codes - part 3

Seismic Stencil Codes - Part 3: In the last two blog posts, we developed a HIP kernel capable of computing high order finite differences commonly needed in seismic wave propagation.

August 29, 2024 by Justin Chang, Ossian O'Reilly