Recent Posts - Page 10#

Introducing Instella-Math: Fully Open Language Model with Reasoning Capability

Instella-Math is AMD’s 3B reasoning model, trained on 32 MI300X GPUs with open weights, optimized for logic, math, and chain-of-thought tasks.

August 09, 2025 by Xiaodong Yu, Jiang Liu, Yusheng Su, Gowtham Ramesh, Zicheng Liu, Prakamya Mishra, Sudhanshu Ranjan, Jialian Wu, Ximeng Sun, Ze Wang, Emad Barsoum

Running ComfyUI in Windows with ROCm on WSL

Run ComfyUI on Windows with ROCm and WSL to harness Radeon GPU power for local AI tasks like Stable Diffusion—no dual-boot needed

August 07, 2025 by Warren Eng

Day 0 Developer Guide: Running the Latest Open Models from OpenAI on AMD AI Hardware

Day 0 support across our AI hardware ecosystem from our flagship AMD InstinctTM MI355X and MI300X GPUs, AMD Radeon™ AI PRO R700 GPUs and AMD Ryzen™ AI Processors

August 05, 2025 by Andy Luo, Shekhar Pandey, Hongxia Yang, Mahdi Ghodsi, Charles Yang, Niles Burbank, George Wang, Kailash Gogineni, Xun Wang, Zhenyu Gu, Yao Fu, Yanyuan Qin, Anshul Gupta

AMD Hummingbird Image to Video: A Lightweight Feedback-Driven Model for Efficient Image-to-Video Generation

We present AMD Hummingbird, offering a two-stage distillation framework for efficient, high-quality text-to-video generation using compact models.

August 03, 2025 by Takashi Isobe, Dong zhou, He Cui, Mengmeng Ge, Dong Li, Emad Barsoum

GEAK: Introducing Triton Kernel AI Agent & Evaluation Benchmarks

AMD introduces GEAK, an AI agent for generating optimized Triton GPU kernels, achieving up to 63% accuracy and up to 2.59× speedups on MI300X GPUs.

August 01, 2025 by Jianghui Wang, Vinay Joshi, Saptarshi Majumder, Chao Xu, Bin Ding, Ziqiong Liu, Pratik Prabhanjan Brahma, Dong Li, Zicheng Liu, Emad Barsoum

Accelerating Parallel Programming in Python with Taichi Lang on AMD GPUs

This blog provides a how-to guide on installing and programming with Taichi Lang on AMD Instinct GPUs.

July 31, 2025 by Tiffany Mintz, Yao Liu, Phani Vaddadi, Vish Vadlamani

Graph Neural Networks at Scale: DGL with ROCm on AMD Hardware

Accelerate Graph Deep Learning on AMD GPUs with DGL and ROCm—scale efficiently with open tools and optimized performance.

July 31, 2025 by Mukhil Azhagan Mallaiyan Sathiaseelan, Anuya Welling, Yao Liu, Phani Vaddadi, Vish Vadlamani

Avoiding LDS Bank Conflicts on AMD GPUs Using CK-Tile Framework

This blog shows how CK-Tile’s XOR-based swizzle optimizes shared memory access in GEMM kernels on AMD GPUs by eliminating LDS bank conflicts

July 25, 2025 by Haocong Wang, Clement Lin, Menghsuan Yang, Yuchen Lin, Bobo Fang, Chunhung Wang, David Li, George Wang, Anshul Gupta

Benchmarking Reasoning Models: From Tokens to Answers

Learn how to benchmark reasoning tasks. Use Qwen3 and vLLM to test true reasoning performance, not just how fast words are generated.

July 24, 2025 by Dominic Widdows

Chain-of-Thought Guided Visual Reasoning Using Llama 3.2 on a Single AMD Instinct MI300X GPU

Fine-tune Llama 3.2 Vision models on AMD MI300X GPU using Torchtune, achieving 2.3× better accuracy with 11B vs 90B model on chart-based tasks.

July 21, 2025 by Matthias Reso

Introducing ROCm-LS: Accelerating Life Science Workloads with AMD Instinct™ GPUs

Accelerate life science and medical workloads with ROCm-LS, AMDs GPU-optimized toolkit for faster multidimensional image processing and vision.

July 18, 2025 by Soumitra Chatterjee, Karthik Kashyap Thatipamula, Deeksha Goplani, Ish Kool, Anik Chaudhuri, Vikas C Sajjan, Marco Grond

Announcing hipCIM: A Cutting-Edge Solution for Accelerated Multidimensional Image Processing

Fully utilize the power of AMDs Instinct GPUs to process and interpret detailed multidimensional images with lightning speed.

July 18, 2025 by Soumitra Chatterjee, Karthik Kashyap Thatipamula, Deeksha Goplani, Ish Kool, Anik Chaudhuri, Vikas C Sajjan, Marco Grond

Prev Page 10 of 27 Next