AI Blogs - Page 12#

Accelerating models on ROCm using PyTorch TunableOp
Accelerating models on ROCm using PyTorch TunableOp

A Guide to Implementing and Training Generative Pre-trained Transformers (GPT) in JAX on AMD GPUs
A Guide to Implementing and Training Generative Pre-trained Transformers (GPT) in JAX on AMD GPUs

Deep Learning Recommendation Models on AMD GPUs
Deep Learning Recommendation Model on AMD GPU

Mamba on AMD GPUs with ROCm
Best practices of using Mamba on AMD GPUs with ROCm

Fine-tuning and Testing Cutting-Edge Speech Models using ROCm on AMD GPUs
This blog post demonstrates how to fine-tune and test three state-of-the-art machine learning Automatic Speech Recognition (ASR) models, running on AMD GPUs using ROCm.

TensorFlow Profiler in practice: Optimizing TensorFlow models on AMD GPUs
TensorFlow Profiler measures resource use and performance of models, helping identify bottlenecks for optimization. This blog demonstrates the use of the TensorFlow Profiler tool on AMD hardware.

SmoothQuant model inference on AMD Instinct MI300X using Composable Kernel
SmoothQuant model inference on AMD Instinct MI300X using Composable Kernel

Unveiling performance insights with PyTorch Profiler on an AMD GPU
Unveiling Performance Insights with PyTorch Profiler on an AMD GPU

Panoptic segmentation and instance segmentation with Detectron2 on AMD GPUs
Object Detection and Image Segmentation with Detectron2 on AMD GPU

Accelerating Large Language Models with Flash Attention on AMD GPUs
Accelerating Large Language Models with Flash Attention on AMD GPUs