Applications & models - Page 2

Applications & models - Page 2#

Explore the latest blogs about applications and models in the ROCm ecosystem, including machine learning frameworks, AI models, and application case studies.

Empowering Developers to Build a Robust PyTorch Ecosystem on AMD ROCm™ with Better Insights and Monitoring

Production ROCm support for N-1 to N+1 PyTorch releases is in progress. The AI Software Head-Up Dashboard shows status of PyTorch on ROCm.

October 21, 2025 by Hongxia Yang, Peng Sun, Nick Romero, Jeff Daily, Jithun Nair, Pruthvi Madugundu, Jagadish Krishnamoorthy, Srinivasan Subramanian, Eli Uriegas

Kimi-K2-Instruct: Enhanced Out-of-the-Box Performance on AMD Instinct MI355 Series GPUs

Learn how AMD Instinct MI355 Series GPUs deliver competitive Kimi-K2 inference with faster TTFT, lower latency, and strong throughput.

October 16, 2025 by Wei Cai, Fan Wu, George Wang

Announcing MONAI 1.0.0 for AMD ROCm: Breakthrough AI Acceleration for Medical Imaging Models on AMD Instinct™ GPUs

Learn how to use Medical Open Network for Artificial Intelligence (MONAI) 1.0 on ROCm, with examples and demonstrations.

October 07, 2025 by Soumitra Chatterjee, Anik Chaudhuri, Chandan Sharma, Vikas C Sajjan, Vish Vadlamani, Dominic Widdows, Phani Vaddadi

Medical Imaging on MI300X: Optimized SwinUNETR for Tumor Detection

Learn how to setup, run and optimize SwinUNETR on AMD MI300X GPUs for fast medical imaging 3D segmentation of tumors using fast, large ROIs.

October 07, 2025 by Joaquin Rives Gambin, Vasumathi Neralla, David Björelind, Rui Sampaio

Optimizing FP4 Mixed-Precision Inference with Petit on AMD Instinct MI250 and MI300 GPUs: A Developer’s Perspective

Learn how FP4 mixed-precision on AMD GPUs boosts inference speed and integrates seamlessly with SGLang.

October 06, 2025 by Haohui Mai, Charles Yang, George Wang

Optimizing Drug Discovery Tools on AMD MI300X Part 2: 3D Molecular Generation with SemlaFlow

Learn how to set up, run, and optimize SemlaFlow, a molecular generation tool, on AMD MI300X GPUs for faster drug discovery workflows

October 03, 2025 by Vasumathi Neralla, Rui Sampaio

From Ingestion to Inference: RAG Pipelines on AMD GPUs

Build a RAG enhanced GenAI application that improves the quality of model responses by incorporating data that is missing in the model training data.

October 02, 2025 by Lin Sun, Anuya Welling, Fabricio Flores, Eliot Li, Yao Liu, Phani Vaddadi, Vish Vadlamani

Enabling FlashInfer on ROCm for Accelerated LLM Serving

FlashInfer is an open-source library for accelerating LLM serving that is now supported by ROCm.

October 01, 2025 by Rishi Madduri, Diptorup Deb, Debasis Mandal, Clint Greene, Mukhil Azhagan Mallaiyan Sathiaseelan, Yao Liu, Phani Vaddadi, Vish Vadlamani

Coding Agents on AMD GPUs: Fast LLM Pipelines for Developers

Accelerate AI-assisted coding with agentic workflows on AMD GPUs. Deploy DeepSeek-V3.1 via SGLang, vLLM, or llama.cpp to power fast, scalable coding agents

September 30, 2025 by Lin Sun, Yao Liu, Phani Vaddadi, Vish Vadlamani

Day-0 Support for the SGLang-Native RL Framework - slime on AMD Instinct™ GPUs

Learn how to deploy slime on AMD GPUs for high-performance RL training with ROCm optimization

September 25, 2025 by Yusheng Su, Yuzhen Zhou, Jin Pan, Gowtham Ramesh, Xiaodong Yu, Jialian Wu, Ze Wang, Ximeng Sun, Jiang Liu, Hao Chen, Zicheng Liu, Emad Barsoum

Accelerating Audio-Driven Video Generation: WAN2.2-S2V on AMD ROCm

This blog will highlight AMD ROCm’s ability to power next-generation audio-to-video models with simple, reproducible workflows.

September 24, 2025 by Johanna Yang, Kristoffer Peyron

A Simple Design for Serving Video Generation Models with Distributed Inference

Minimalist FastAPI + Redis + Torchrun design for serving video generation models with distributed inference.

September 24, 2025 by Sopiko Kurdadze, Albin Toft

Prev Page 2 of 15 Next