AI - Applications & Models - Page 6

AI - Applications & Models - Page 6#

January 22, 2026

Nitro-AR: A Compact AR Transformer for High-Quality Image Generation

Nitro-AR is a compact E-MMDiT-based masked AR image generator matching diffusion quality with lower latency on AMD GPUs.

./artificial-intelligence/nitro-ar/README.html

January 12, 2026

Athena-PRM: Enhancing Multimodal Reasoning with Data-Efficient Process Reward Models

Learn how to utilize a data-efficient Process Reward Model to enhance the reasoning ability of the Large Language/Multimodal Models.

./artificial-intelligence/amd-elvm/README.html

January 08, 2026

Bridging the Last Mile: Deploying Hummingbird-XT for Efficient Video Generation on AMD Consumer-Grade Platforms

Learn how to use Hummingbird-XT and Hummingbird-XTX modelS to generate videos. Explore the video diffusion model acceleration solution, including dit distillation method and light VAE model.

./artificial-intelligence/hummingbirdxt/README.html

January 07, 2026

High-Resolution Weather Forecasting with StormCast on AMD Instinct GPU Accelerators

A showcase for how to run high-resolution weather prediction models such as StormCast on AMD Instinct hardware.

./artificial-intelligence/stormcast-inference/README.html

January 07, 2026

Breaking the Accuracy-Speed Barrier: How MXFP4/6 Quantization Revolutionizes Image and Video Generation

Explore how MXFP4/6, supported by AMD Instinct™ MI350 series GPUs, achieves BF16-comparable image and video generation quality.

./artificial-intelligence/mxfp-t2i-t2v/README.html

January 06, 2026

ROCm Fork of MaxText: Structure and Strategy

Learn how the ROCm fork of MaxText mirrors upstream while enabling offline testing, minimal datasets, and platform-agnostic, decoupled workflows.

./artificial-intelligence/rocm-fork-of/README.html

January 06, 2026

ROCm MaxText Testing — Decoupled (Offline) and Cloud-Integrated Modes

Learn how to run MaxText unit tests on AMD ROCm GPUs in offline and cloud modes for fast validation, clear reports, and reproducible workflows.

./artificial-intelligence/running-rocm-maxtext/README.html

January 02, 2026

SparK: Query-Aware Unstructured Sparsity with Recoverable KV Cache Channel Pruning

In this blog we will discuss SparK, a training-free, plug-and-play method for KV cache compression in large language models (LLMs).

./artificial-intelligence/spark-blog/README.html

December 23, 2025

GEAK-Triton v2 Family of AI Agents: Kernel Optimization for AMD Instinct GPUs

Introducing GEAK Family - AI-driven agents that automate GPU kernel optimization for AMD Instinct GPUs with hardware-aware feedback

./artificial-intelligence/geak-agents-family/README.html

December 18, 2025

A Step-by-Step Walkthrough of Decentralized LLM Training on AMD GPUs

Learn how to train LLMs across decentralized clusters on AMD Instinct MI300 GPUs with DiLoCo and Prime—scale beyond one datacenter.

./artificial-intelligence/decentralized-training/README.html

December 10, 2025

Medical Imaging on MI300X: SwinUNETR Inference Optimization

A practical guide to optimizing SwinUNETR inference on AMD Instinct™ MI300X GPUs for fast 3D segmentation of tumors in medical imaging.

./artificial-intelligence/swinunetr-inference-optimization/README.html

December 08, 2025

Scaling AI Inference Performance with vLLM on AMD Instinct MI355X GPUs

Explore how MI355X performs against B200 in vLLM benchmarks across DeepSeek-R1, GPT-OSS-120B, Qwen3-235B and Llama-3.3-70B.

./artificial-intelligence/scaling-ai-inference/README.html

Prev Page 6 of 20 Next