AI Blogs - Page 2#
Týr-the-Pruner: Search-based Global Structural Pruning for LLMs
This blog introduces Týr-the-Pruner, a search-based, end-to-end framework for global structural pruning of large language models (LLMs).
Optimizing LLM Workloads: AMD Instinct MI355X GPUs Drive Competitive Performance
Explore ROCm 7.0’s AI training boost! See how MI355X accelerates JAX and PyTorch frameworks to unlock faster and efficient LLM scaling.
VLM Fine-Tuning for Robotics on AMD Enterprise AI Suite
Fine-tune OpenCLIP with Bridge Data V2 on ROCm to enable robotics related fine-tuning
Fine-Tune LLMs for Proteins with AMD Enterprise AI Suite
Fine-tune Llama 3.1 8B with ROCm for advanced protein sequence insights in bioinformatics
Exploring Gameplay Video Generation with Hunyuan-GameCraft
Learn to generate dynamic, action-controllable gameplay videos from single images using Hunyuan-GameCraft on AMD Instinct MI300X GPUs with ROCm.
Using Reinforcement Learning to Fix Text in AI-Generated Videos
Demonstrates how Flow-GRPO can be used to fine-tune Wan models to better generate text in videos by covering background, set up and some examples of training runs.
The vLLM MoE Playbook: A Practical Guide to TP, DP, PP and Expert Parallelism
Learn how to combine TP, DP, PP, and EP for MoE models. Discover proven strategies to maximize performance on your vLLM deployments.
Inference with HunyuanWorld-Voyager on AMD Instinct GPUs
Learn how to run inference with HunyuanWorld-Voyager, a state-of-the-art 3D world model, on AMD Instinct MI300X GPUs using ROCm for efficient video generation and prediction.
Accelerating AI-Driven Crystalline Materials Design with MatterGen on AMD Instinct MI300X
Learn how to run a generative model for inorganic material design with MatterGen on MI300X.
AMD Enterprise AI Suite: Open Infrastructure for Production AI
Explore an open, GPU-optimized platform to build, deploy, and scale enterprise AI workloads on AMD Instinct with production-ready performance.
AMD Inference Microservice (AIM): Production Ready Inference on AMD Instinct™ GPUs
Learn how AIM delivers efficient, scalable inference on AMD Instinct GPUs and see how it simplifies deployment, optimization, and operations.
Plug-and-Play CuPy on ROCm: Data Analytics Acceleration Made Simple
Learn about how to enhance your analytics project with the latest AMD CuPy release.