Contents

AMD ROCm™ Blogs

Ecosystems and partners

Stone Ridge Expands Reservoir Simulation Options with AMD Instinct™ Accelerators

Stone Ridge Technology latest development effort was to port ECHELON from CUDA to the AMD HIP platform, enabling ECHELON to use AMD Instinct GPUs like the MI210, MI250X, and the upcoming MI300 Series

Stone Ridge Expands Reservoir Simulation Options with AMD Instinct™ Accelerators
University of Michigan, AMD collaboration

AMD Collaboration with the University of Michigan offers High Performance Open-Source Solutions to the Bioinformatics Community

AMD Collaboration with the University of Michigan offers High Performance Open-Source Solutions to the Bioinformatics Community
Siemens and AMD partnership

Siemens taps AMD Instinct™ GPUs to expand high-performance hardware options for Simcenter STAR-CCM+

Siemens taps AMD Instinct™ GPUs to expand high-performance hardware options for Simcenter STAR-CCM+

Applications and models

Accelerating models on ROCm using PyTorch TunableOp

In this blog, we will show how to leverage PyTorch TunableOp to accelerate models using ROCm on AMD GPUs.

Accelerating models on ROCm using PyTorch TunableOp
A Guide to Implementing and Training Generative Pre-trained Transformers (GPT) in JAX on AMD GPUs

In this blog, we illustrate the process of implementing and training a Generative Pre-trained Transformer (GPT) model in JAX.

A Guide to Implementing and Training Generative Pre-trained Transformers (GPT) in JAX on AMD GPUs
Mamba on AMD GPUs with ROCm

In this blog, we delve into the Mamba architecture and demonstrate how to use Mamba on AMD GPUs with the ROCm platform.

Mamba on AMD GPUs with ROCm
Deep Learning Recommendation Model on AMD GPU

In this blog, we demonstrate how to build a simple Deep Learning Recommendation Model (DLRM) with PyTorch on a ROCm-capable AMD GPU.

Deep Learning Recommendation Models on AMD GPUs
Segment Anything with AMD GPUs

The Segment Anything Model (SAM) is a cutting-edge image segmentation model that democratizes promptable segmentation.

./artificial-intelligence/segment-anything/README.html
Detectron2 on AMD GPUs

Panoptic segmentation and instance segmentation with Detectron2 on AMD GPUs.

Panoptic segmentation and instance segmentation with Detectron2 on AMD GPUs

Software tools & optimizations

SmoothQuant model inference on AMD Instinct MI300X using Composable Kernel

The AMD ROCm™ Composable Kernel (CK) library provides a programming model for writing performance-critical kernels for machine learning workloads.

./software-tools-optimization/ck-int8-gemm-sq/README.html
AMD in Action: Unveiling the Power of Application Tracing and Profiling

Rocprof is a robust tool designed to analyze and optimize the performance of HIP programs on AMD ROCm platforms

AMD in Action: Unveiling the Power of Application Tracing and Profiling
Reading AMD GPU ISA

In this blog post, we will discuss how to read and understand the ISA for AMD’s Graphics Core Next (AMDGCN) architecture

Reading AMD GPU ISA
Application portability with HIP

HIP enables these High-Performance Computing (HPC) facilities to transition their CUDA codes to run and take advantage of the latest AMD GPUs

Application portability with HIP
C++17 parallel algorithms and HIPSTDPAR

The C++17 standard added the concept of parallel algorithms to the pre-existing C++ Standard Library

C++17 parallel algorithms and HIPSTDPAR
Affinity part 1 - Affinity, placement, and order

Affinity is a way for processes to indicate preference of hardware components so that a given process is always scheduled to the same set of compute cores and is able to access data from local memory efficiently

Affinity part 1 - Affinity, placement, and order

Stay informed