Recent Posts#

Empowering Developers to Build a Robust PyTorch Ecosystem on AMD ROCm™ with Better Insights and Monitoring

At AMD, the PyTorch ecosystem team is committed to delivering an exceptional out-of-the-box experience for developers. Over the past year, the team has made significant progress in expanding PyTorch ecosystem support, improving CI test coverage across a wider range of GPU architectures, enhancing training and inference capabilities, streamlining the developer experience, introducing new functionality and performance optimizations, and strengthening quality monitoring. This blog showcases our ongoing efforts to build a robust PyTorch ecosystem on AMD ROCm™ Software, including the production-readiness of PyTorch across N-1, N, and N+1 releases aligned with ROCm versions. We also introduce the AI SoftWare Heads-Up Dashboard (AISWHUD), a powerful new tool that provides deep insights into the health and performance of the PyTorch ecosystem on ROCm, empowering developers with greater visibility and control.

October 21, 2025 by Hongxia Yang, Peng Sun, Nick Romero, Jeff Daily, Jithun Nair, Pruthvi Madugundu, Jagadish Krishnamoorthy, Srinivasan Subramanian, Eli Uriegas

Prev Page 1 of 21 Next

Recent Posts

Recent Posts#

Empowering Developers to Build a Robust PyTorch Ecosystem on AMD ROCm™ with Better Insights and Monitoring

ROCm 7.9 Technology Preview: ROCm Core SDK and TheRock Build System

Kimi-K2-Instruct: Enhanced Out-of-the-Box Performance on AMD Instinct MI355 Series GPUs

Gumiho: A New Paradigm for Speculative Decoding — Earlier Tokens in a Draft Sequence Matter More

GEMM Tuning within hipBLASLt– Part 2

Announcing MONAI 1.0.0 for AMD ROCm: Breakthrough AI Acceleration for Medical Imaging Models on AMD Instinct™ GPUs

Medical Imaging on MI300X: Optimized SwinUNETR for Tumor Detection

Optimizing FP4 Mixed-Precision Inference with Petit on AMD Instinct MI250 and MI300 GPUs: A Developer’s Perspective

Optimizing Drug Discovery Tools on AMD MI300s Part 2: 3D Molecular Generation with SemlaFlow

Elevating 3D Scene Rendering with GSplat

From Ingestion to Inference: RAG Pipelines on AMD GPUs

Enabling FlashInfer on ROCm for Accelerated LLM Serving