HPC Blogs - Page 3

HPC Blogs - Page 3#

Using statistical methods to reliably compare algorithm performance in large generative AI models with JAX Profiler on AMD GPUs

Using Statistical Methods to Reliably Compare Algorithm Performance in Large Generative AI Models with JAX Profiler on AMD GPUs

July 22, 2024 by Douglas Jia

Mamba on AMD GPUs with ROCm

Best practices of using Mamba on AMD GPUs with ROCm

June 28, 2024 by Sean Song, Jassani Adeem, Moskvichev Arseny

TensorFlow Profiler in practice: Optimizing TensorFlow models on AMD GPUs

TensorFlow Profiler measures resource use and performance of models, helping identify bottlenecks for optimization. This blog demonstrates the use of the TensorFlow Profiler tool on AMD hardware.

June 18, 2024 by Fabricio Flores

Reading AMD GPU ISA

Reading AMDGCN ISA

May 13, 2024 by Asitav Mishra, Corbin Robeck

AMD in Action: Unveiling the Power of Application Tracing and Profiling

May 07, 2024 by Fabricio Flores

Application portability with HIP

April 26, 2024 by Suyash Tandon, Maria Ruiz Varela, Gina Sitaraman, Bob Robey

C++17 parallel algorithms and HIPSTDPAR #

C++17 parallel algorithms and HIPSTDPAR

April 18, 2024 by Alessandro Fanfarillo, Alex Voicu

Programming AMD GPUs with Julia

April 16, 2024 by Anton Smirnov

Affinity part 1 - Affinity, placement, and order

Affinity Part 1

April 16, 2024 by Gina Sitaraman, Bob Robey, George Markomanolis

Affinity part 2 - System topology and controlling affinity

Affinity Part 2

April 16, 2024 by Gina Sitaraman, Bob Robey, George Markomanolis

Sparse matrix vector multiplication - part 1

Sparse matrix vector multiplication - Part 1

November 03, 2023 by Paul Mullowney

Jacobi Solver with HIP and OpenMP offloading

Finite difference method - Laplacian Part 1

September 15, 2023 by Asitav Mishra, Rajat Arora, Justin Chang

Prev Page 3 of 4 Next