Recent Posts - Page 7#

March 24, 2026

GROMACS on AMD Instinct GPUs: A Complete Build Guide

Build GROMACS with HIP, UCX, and OpenMPI on AMD MI300X/MI355X — covering bare metal, Apptainer, and Docker deployments.

./artificial-intelligence/gromacs-build-guide/README.html

March 23, 2026

AMD Device Metrics Exporter v1.4.2: Enhanced Observability, Deeper RAS Insights, and Smarter GPU Telemetry for Modern HPC & AI Clusters

Struggling with GPU bottlenecks? Learn how AMD DME v1.4.2 uncovers power, thermal, and RAS issues with actionable, production-ready telemetry.

./software-tools-optimization/device-metrics-exporter/README.html

March 23, 2026

Edge-to-Cloud Robotics with AMD ROCm: From Data Collection to Real-Time Inference

This blog demonstrates a comprehensive Edge-to-Cloud robotics AI solution powered by the AMD ecosystem and the Hugging Face LeRobot framework.

./artificial-intelligence/rocm-blogsblogsartificial-in/README.html

March 19, 2026

hipBLASLt Online GEMM Tuning

Learn how to improve model performance with hipBLASLt online tuning merged into LLM framework

./artificial-intelligence/hipblaslt_online_tuning/README.html

March 19, 2026

Utilizing AMD Instinct GPU Accelerators for Weather and Precipitation Forecasting with NeuralGCM

A showcase of how to run NeuralGCM, a hybrid GCM model, on AMD Instinct hardware, including an introduction, installation, inference, and plotting.

./artificial-intelligence/neuralgcm-inference/README.html

March 18, 2026

Multi-Node Distributed Inference for Diffusion Models with xDiT

Follow a tutorial on multi-node video generation with diffusion models, covering scaling considerations and a practical Docker-based example.

./software-tools-optimization/multinode-hunyuanvideo-xdit/README.html

March 13, 2026

GROMACS Performance on AMD Instinct MI355X

Explore GROMACS molecular dynamics performance benchmarks on AMD Instinct MI355X GPUs with HIP acceleration.

./artificial-intelligence/mi355-gromacs-benchmarks/README.html

March 10, 2026

FP8 GEMM Optimization on AMD CDNA™4 Architecture

Learn how to build high-performance FP8 GEMM kernels on AMD CDNA™4 GPUs using MFMA, LDS swizzling, and double-buffering.

./software-tools-optimization/cdna4-gemm-kernels/README.html

March 09, 2026

Agentic Diagnosis for LLM Training at Scale

Explore how AI agents diagnose LLM training incidents — from RCCL hangs to throughput regressions — in one prompt with MaxText-Slurm.

./software-tools-optimization/maxtext-slurm-agentic-diagnosis/README.html

March 09, 2026

Getting Started with ComfyUI on AMD Radeon™ RX 9000 Series GPUs

Learn how to set up and optimize ComfyUI on AMD Radeon RX 9000 GPUs with ROCm 7.1 — solve common issues and start generating.

./artificial-intelligence/comfyui-radeon-9000/README.html

March 06, 2026

HPC Coding Agent - Part 3: MCP Tool for Profiling

Build an AI agent specialized in optimizing HPC workloads by connecting a Cline agent to expert-level AMD profiling tools via a custom MCP server.

./artificial-intelligence/hpc-agent-profile/README.html

March 06, 2026

Fine-Tuning AI Surrogate Models for Physics Simulations with Walrus on AMD Instinct GPU Accelerators

A showcase of fine-tuning the foundational physics simulation model Walrus on a new physics dataset using AMD Instinct hardware.

./artificial-intelligence/walrus-finetuning/README.html

Prev Page 7 of 35 Next