HPC Blogs - Page 3#

Using statistical methods to reliably compare algorithm performance in large generative AI models with JAX Profiler on AMD GPUs
Using Statistical Methods to Reliably Compare Algorithm Performance in Large Generative AI Models with JAX Profiler on AMD GPUs
July 22, 2024 by Douglas Jia

Mamba on AMD GPUs with ROCm
Best practices of using Mamba on AMD GPUs with ROCm
June 28, 2024 by Sean Song, Jassani Adeem, Moskvichev Arseny

TensorFlow Profiler in practice: Optimizing TensorFlow models on AMD GPUs
TensorFlow Profiler measures resource use and performance of models, helping identify bottlenecks for optimization. This blog demonstrates the use of the TensorFlow Profiler tool on AMD hardware.
June 18, 2024 by Fabricio Flores

AMD in Action: Unveiling the Power of Application Tracing and Profiling
AMD in Action: Unveiling the Power of Application Tracing and Profiling
May 07, 2024 by Fabricio Flores

C++17 parallel algorithms and HIPSTDPAR #
C++17 parallel algorithms and HIPSTDPAR
April 18, 2024 by Alessandro Fanfarillo, Alex Voicu

Sparse matrix vector multiplication - part 1
Sparse matrix vector multiplication - Part 1
November 03, 2023 by Paul Mullowney