Posts tagged Scientific Computing

Modernizing Taichi Lang to LLVM 20 for MI325X GPU Acceleration

04 December 2025

Our first Taichi Lang blog intrduced you to Taichi Lang on AMD’s MI210 and MI250X GPUs. This previous version of Taichi was limited by it’s dependence on outdated versions of LLVM. We have modernized Taichi to LLVM 20 to take advantage of the latest advances in LLVM’s code generation capabilities. This modernization also allows us to make Taichi available for execution on newer AMD Instinct GPUs, MI300X and MI325X. As with our previous blog, we provide you with a guide for understanding Taichi, and walk you through installing Taichi, as well as, writing and executing a Taichi program.

Read more ...

HPC Coding Agent - Part 1: Combining GLM-powered Cline and RAG Using MCP

03 December 2025

Navigating through extensive High Performance Computing (HPC) documentation can be challenging, especially when working with complex supercomputer environments like LUMI, one of the pan-European pre-exascale supercomputers. Traditional search methods often fall short when you need contextual, actionable answers to technical questions. RAG (Retrieval-Augmented Generation) agents offer a solution by combining large language model reasoning with domain-specific knowledge retrieval to provide accurate, cited responses to your HPC queries.

Read more ...

Accelerating AI-Driven Crystalline Materials Design with MatterGen on AMD Instinct MI300X

21 November 2025

The search for new inorganic materials has always been central to scientific and technological progress. From the silicon that powered the microelectronics revolution to the lithium compounds enabling modern batteries, advances in materials have defined entire eras of innovation. Yet, discovering new compounds with desired properties remains an exceptionally difficult challenge.

Read more ...

Plug-and-Play CuPy on ROCm: Data Analytics Acceleration Made Simple

14 November 2025

AMD is committed to ensuring that CuPy works seamlessly on AMD Instinct GPUs through ROCm and has worked to support the latest features in upstream CuPy on ROCm. In this blog, you will learn about the enhancements in the current and upcoming AMD CuPy releases that will supercharge your analytics and data science projects. In an earlier blog on CuPy and hipDF, it was demonstrated that CuPy and hipDF can be applied to complex analytics tasks with large datasets on ROCm using AMD GPUs. That blog used a PyPI wheel forked from earlier versions of CuPy and cuDF, and both CuPy and ROCm have advanced since then. In the latest AMD CuPy release, you will find many exciting improvements from the upstream CuPy library as well as ROCm 7.

Read more ...

Accelerating Vector Search: hipVS and hipRAFT on AMD

13 November 2025

In this blog, you’ll get an introductory look at hipVS, AMD’s GPU-accelerated vector search library, and its relationship to hipRAFT, a foundational library used by hipVS and other ROCmDS projects. Using an interactive Jupyter notebook, you’ll explore four major vector search methods available in hipVS: Brute-Force KNN, IVF-Flat, IVF-PQ, and CAGRA—each illustrating different trade-offs in accuracy, performance, and memory. You’ll see how to build and query vector search indexes using the hipVS API for applications such as semantic search, recommendation systems, and RAG pipelines. Since the API is compatible with NVIDIA’s cuVS, migrating workflows to AMD hardware is seamless and requires minimal changes.

Read more ...

ROCm 7.0: An AI-Ready Powerhouse for Performance, Efficiency, and Productivity

16 September 2025

Artificial intelligence now defines the performance envelope for modern computation. In this blog, we introduce the AI-centric ROCm 7.0 designed to help our community directly benefit from this dramatic paradigm shift. ROCm 7.0 delivers a platform purpose-built for the era of generative AI, large-scale inference and training, and accelerated discovery, helping you boost the performance, efficiency, and scalability of your workloads.

Read more ...

Accelerating Parallel Programming in Python with Taichi Lang on AMD GPUs

31 July 2025

Taichi Lang is an open-source, imperative, parallel programming language for high-performance numerical computation. It is embedded in Python and uses just-in-time (JIT) compiler frameworks (e.g. LLVM) to offload the compute-intensive Python code to the native GPU or CPU instructions. The language has broad applications spanning real-time physical simulation, numerical computation, augmented reality, artificial intelligence, vision and robotics, visual effects in films and games, general-purpose computing, and much more [1].

Read more ...

AMD ROCm: Powering the World’s Fastest Supercomputers

10 June 2025

From breaking the exaFLOP barrier with Frontier to setting new performance records with El Capitan, AMD is transforming what’s possible in high-performance computing (HPC). But the story goes beyond hardware. At the core of these world-class systems is ROCm, AMD’s open, high-performance software platform enabling new levels of scientific discovery and AI advancement.

Read more ...

Introducing ROCm-DS: GPU-Accelerated Data Science for AMD Instinct™ GPUs

20 May 2025

AMD is excited to announce the early access release of ROCm-DS (ROCm Data Science), a new toolkit designed to accelerate data processing workloads on AMD Instinct™ GPUs. Built on the core ROCm toolkit, ROCm-DS promises to significantly enhance performance and scalability for data-intensive applications, catering to the pressing needs of today’s data-driven landscape. ROCm-DS is based on the open source libraries in the RAPIDS ecosystem. This collection of libraries enables a multitude of data processing operations, allowing new and existing workloads to tap into the computational advantages offered by AMD Instinct Datacenter GPUs. This early access release introduces two powerful new libraries: hipDF and hipGRAPH.

Read more ...

Installing ROCm from source with Spack

14 April 2025

In this guide you will learn how Spack makes building ROCm components from source easier and more flexible than other methods. This blog will walk you through installing ROCm from source using the Spack package manager. We will also discuss Spack’s place among other ROCm installation methods, the landscape of ROCm components, and show you how ROCm, as an open-source software platform, allows developers to streamline software stacks for their applications.

Read more ...

Deep dive into the MI300 compute and memory partition modes

09 February 2025

This blog introduces the inner compute and memory architecture of the AMD Instinct™ MI300, showing you how to use the MI300 GPU’s different partition modes to supercharge performance critical applications. In this blog, you will first get a brief introduction to the MI300 architecture, explaining how the MI300 compute and memory partitions can be used to your advantage. You will then learn in detail the compute partitioning modes and the memory partitioning modes, Further, two case studies demonstrate and benchmark the performance of the different modes. For convenience this blog uses the MI300X as a case-in-point example.

Read more ...

Seismic stencil codes - part 3

29 August 2024

12 Aug, 2024 by

and .

Read more ...

Seismic stencil codes - part 2

29 August 2024

12 Aug, 2024 by

and .

Read more ...

Seismic stencil codes - part 1

29 August 2024

12 Aug, 2024 by

and .

Read more ...

Graph analytics on AMD GPUs using Gunrock

29 July 2024

Graphs and graph analytics are related concepts that can help us understand complex data and relationships. In this context, a graph is a mathematical model that represents entities (called nodes or vertices) and their connections (called edges or links). And graph analytics is a form of data analysis that uses graph structures and algorithms to reveal insights from the data.

Read more ...

Programming AMD GPUs with Julia

16 April 2024

Julia is a high-level, general-purpose dynamic programming language that automatically compiles to efficient native code via LLVM, and supports multiple platforms. With LLVM, comes the support for programming GPUs, including AMD GPUs.

Read more ...

Sparse matrix vector multiplication - part 1

03 November 2023

3 Nov, 2023 by

.

Read more ...

Jacobi Solver with HIP and OpenMP offloading

15 September 2023

15 Sept, 2023 by

, , .

Read more ...

Finite difference method - Laplacian part 4

18 July 2023

18 Jul, 2023 by

, , .

Read more ...

Finite difference method - Laplacian part 3

11 May 2023

11 May, 2023 by

, , , , .

Read more ...

Finite difference method - Laplacian part 2

04 January 2023

4 Jan, 2023 by

, , , , .

Read more ...

Finite difference method - Laplacian part 1

14 November 2022

14 Nov, 2022 by

, , , , .

Read more ...