Posts tagged AI/ML
01 May 2024 - Inferencing with Mixtral 8x22B on AMD GPUs
26 April 2024 - Table Question-Answering with TaPas
26 April 2024 - Multimodal (Visual and Language) understanding with LLaVA-NeXT
17 April 2024 - Inferencing with AI2’s OLMo model on AMD GPU
16 April 2024 - Text Summarization with FLAN-T5
16 April 2024 - Speech-to-Text on an AMD GPU with Whisper
16 April 2024 - PyTorch C++ Extension on AMD GPU
16 April 2024 - Programming AMD GPUs with Julia
16 April 2024 - Program Synthesis with CodeGen
15 April 2024 - Enhancing LLM Accessibility: A Deep Dive into QLoRA Through Fine-tuning Llama 2 on a single AMD GPU
15 April 2024 - Developing Triton Kernels on AMD GPUs
11 April 2024 - GPU Unleashed: Training Reinforcement Learning Agents with Stable Baselines3 on an AMD GPU in Gymnasium Environment
09 April 2024 - ResNet for image classification using AMD GPUs
08 April 2024 - Small language models with Phi-2
04 April 2024 - Using the ChatGLM-6B bilingual language model with AMD GPUs
04 April 2024 - Total body segmentation using MONAI Deploy on an AMD GPU
04 April 2024 - Retrieval Augmented Generation (RAG) using LlamaIndex
04 April 2024 - Inferencing and serving with vLLM on AMD GPUs
04 April 2024 - Image classification using Vision Transformer with AMD GPUs
04 April 2024 - Building semantic search with SentenceTransformers on AMD
01 April 2024 - Scale AI applications with Ray
15 March 2024 - Large language model inference optimizations on AMD GPUs
12 March 2024 - Building a decoder transformer model on AMD GPU(s)
11 March 2024 - Question-answering Chatbot with LangChain on an AMD GPU
08 March 2024 - Music Generation With MusicGen on an AMD GPU
23 February 2024 - Efficient image generation with Stable Diffusion models and ONNX Runtime using AMD GPUs
08 February 2024 - Simplifying deep learning: A guide to PyTorch Lightning
07 February 2024 - Two-dimensional images to three-dimensional scene mapping using NeRF on an AMD GPU
05 February 2024 - Using LoRA for efficient fine-tuning: Fundamental principles
01 February 2024 - Fine-tune Llama 2 with LoRA: Customizing a large language model for question-answering
29 January 2024 - Pre-training BERT using Hugging Face & TensorFlow on an AMD GPU
26 January 2024 - Pre-training BERT using Hugging Face & PyTorch on an AMD GPU
26 January 2024 - Accelerating XGBoost with Dask using multiple AMD GPUs
25 January 2024 - LLM distributed supervised fine-tuning with JAX
24 January 2024 - Efficient image generation with Stable Diffusion models and AITemplate using AMD GPUs
24 January 2024 - Efficient deployment of large language models with Text Generation Inference on AMD GPUs
11 September 2023 - Creating a PyTorch/TensorFlow code environment on AMD GPUs
Posts tagged C++
18 April 2024 - C++17 parallel algorithms and HIPSTDPAR
16 April 2024 - PyTorch C++ Extension on AMD GPU
Posts tagged Compiler
13 May 2024 - Reading AMDGCN ISA
26 April 2024 - Application portability with HIP
18 April 2024 - C++17 parallel algorithms and HIPSTDPAR
14 November 2022 - AMD matrix cores
Posts tagged Computer Vision
09 April 2024 - ResNet for image classification using AMD GPUs
04 April 2024 - Total body segmentation using MONAI Deploy on an AMD GPU
04 April 2024 - Image classification using Vision Transformer with AMD GPUs
Posts tagged Generative AI
17 April 2024 - Inferencing with AI2’s OLMo model on AMD GPU
16 April 2024 - Program Synthesis with CodeGen
15 April 2024 - Enhancing LLM Accessibility: A Deep Dive into QLoRA Through Fine-tuning Llama 2 on a single AMD GPU
04 April 2024 - Image classification using Vision Transformer with AMD GPUs
04 April 2024 - Building semantic search with SentenceTransformers on AMD
01 April 2024 - Scale AI applications with Ray
15 March 2024 - Large language model inference optimizations on AMD GPUs
08 March 2024 - Music Generation With MusicGen on an AMD GPU
23 February 2024 - Efficient image generation with Stable Diffusion models and ONNX Runtime using AMD GPUs
07 February 2024 - Two-dimensional images to three-dimensional scene mapping using NeRF on an AMD GPU
05 February 2024 - Using LoRA for efficient fine-tuning: Fundamental principles
01 February 2024 - Fine-tune Llama 2 with LoRA: Customizing a large language model for question-answering
29 January 2024 - Pre-training BERT using Hugging Face & TensorFlow on an AMD GPU
26 January 2024 - Pre-training BERT using Hugging Face & PyTorch on an AMD GPU
25 January 2024 - LLM distributed supervised fine-tuning with JAX
Posts tagged HPC
13 May 2024 - Reading AMDGCN ISA
26 April 2024 - Application portability with HIP
18 April 2024 - C++17 parallel algorithms and HIPSTDPAR
16 April 2024 - Programming AMD GPUs with Julia
16 April 2024 - Affinity part 2 - System topology and controlling affinity
16 April 2024 - Affinity part 1 - Affinity, placement, and order
03 November 2023 - Sparse matrix vector multiplication - part 1
15 September 2023 - Jacobi Solver with HIP and OpenMP offloading
18 July 2023 - Finite difference method - Laplacian part 4
08 June 2023 - GPU-aware MPI with ROCm
17 May 2023 - Register pressure in AMD CDNA™2 GPUs
11 May 2023 - Finite difference method - Laplacian part 3
26 January 2023 - AMD ROCm™ installation
04 January 2023 - Finite difference method - Laplacian part 2
14 November 2022 - Finite difference method - Laplacian part 1
14 November 2022 - AMD matrix cores
Posts tagged Inference
04 April 2024 - Inferencing and serving with vLLM on AMD GPUs
Posts tagged Installation
26 April 2024 - Application portability with HIP
Posts tagged Julia
16 April 2024 - Programming AMD GPUs with Julia
Posts tagged Kernel
15 April 2024 - Developing Triton Kernels on AMD GPUs
Posts tagged LLM
01 May 2024 - Step-by-Step Guide to Use OpenLLM on AMD GPUs
15 April 2024 - Enhancing LLM Accessibility: A Deep Dive into QLoRA Through Fine-tuning Llama 2 on a single AMD GPU
04 April 2024 - Using the ChatGLM-6B bilingual language model with AMD GPUs
04 April 2024 - Retrieval Augmented Generation (RAG) using LlamaIndex
04 April 2024 - Inferencing and serving with vLLM on AMD GPUs
04 April 2024 - Building semantic search with SentenceTransformers on AMD
01 April 2024 - Scale AI applications with Ray
15 March 2024 - Large language model inference optimizations on AMD GPUs
05 February 2024 - Using LoRA for efficient fine-tuning: Fundamental principles
01 February 2024 - Fine-tune Llama 2 with LoRA: Customizing a large language model for question-answering
29 January 2024 - Pre-training BERT using Hugging Face & TensorFlow on an AMD GPU
26 January 2024 - Pre-training BERT using Hugging Face & PyTorch on an AMD GPU
26 January 2024 - Accelerating XGBoost with Dask using multiple AMD GPUs
25 January 2024 - LLM distributed supervised fine-tuning with JAX
Posts tagged Linear Algebra
14 November 2022 - AMD matrix cores
Posts tagged MONAI
04 April 2024 - Total body segmentation using MONAI Deploy on an AMD GPU
Posts tagged MPI
16 April 2024 - Affinity part 2 - System topology and controlling affinity
16 April 2024 - Affinity part 1 - Affinity, placement, and order
08 June 2023 - GPU-aware MPI with ROCm
Posts tagged Memory
13 May 2024 - Reading AMDGCN ISA
18 April 2024 - C++17 parallel algorithms and HIPSTDPAR
16 April 2024 - Affinity part 2 - System topology and controlling affinity
16 April 2024 - Affinity part 1 - Affinity, placement, and order
03 November 2023 - Sparse matrix vector multiplication - part 1
18 July 2023 - Finite difference method - Laplacian part 4
17 May 2023 - Register pressure in AMD CDNA™2 GPUs
11 May 2023 - Finite difference method - Laplacian part 3
12 April 2023 - Introduction to profiling tools for AMD hardware
09 March 2023 - AMD Instinct™ MI200 GPU memory space overview
04 January 2023 - Finite difference method - Laplacian part 2
14 November 2022 - AMD matrix cores
Posts tagged Mixed Precision
29 March 2024 - Automatic mixed precision in PyTorch using AMD GPUs
Posts tagged Mixtral
01 May 2024 - Inferencing with Mixtral 8x22B on AMD GPUs
Posts tagged Mixture of Experts
01 May 2024 - Inferencing with Mixtral 8x22B on AMD GPUs
Posts tagged Multimodal
Posts tagged NUMA
16 April 2024 - Affinity part 2 - System topology and controlling affinity
16 April 2024 - Affinity part 1 - Affinity, placement, and order
Posts tagged Natural Language Processing
29 January 2024 - Pre-training BERT using Hugging Face & TensorFlow on an AMD GPU
26 January 2024 - Pre-training BERT using Hugging Face & PyTorch on an AMD GPU
25 January 2024 - LLM distributed supervised fine-tuning with JAX
Posts tagged NeRF
Posts tagged Neural Collaborative Filtering
Posts tagged OpenMP
16 April 2024 - Affinity part 2 - System topology and controlling affinity
16 April 2024 - Affinity part 1 - Affinity, placement, and order
Posts tagged Optimization
26 April 2024 - Application portability with HIP
14 November 2022 - AMD matrix cores
Posts tagged Partner Applications
Posts tagged Performance
18 April 2024 - C++17 parallel algorithms and HIPSTDPAR
16 April 2024 - Affinity part 2 - System topology and controlling affinity
16 April 2024 - Affinity part 1 - Affinity, placement, and order
Posts tagged Profiling
12 April 2023 - Introduction to profiling tools for AMD hardware
Posts tagged Programming Languages
18 April 2024 - C++17 parallel algorithms and HIPSTDPAR
Posts tagged PyTorch
26 April 2024 - Table Question-Answering with TaPas
26 April 2024 - Multimodal (Visual and Language) understanding with LLaVA-NeXT
17 April 2024 - Inferencing with AI2’s OLMo model on AMD GPU
16 April 2024 - Text Summarization with FLAN-T5
16 April 2024 - PyTorch C++ Extension on AMD GPU
16 April 2024 - Program Synthesis with CodeGen
11 April 2024 - GPU Unleashed: Training Reinforcement Learning Agents with Stable Baselines3 on an AMD GPU in Gymnasium Environment
09 April 2024 - ResNet for image classification using AMD GPUs
08 April 2024 - Small language models with Phi-2
04 April 2024 - Using the ChatGLM-6B bilingual language model with AMD GPUs
04 April 2024 - Total body segmentation using MONAI Deploy on an AMD GPU
29 March 2024 - Automatic mixed precision in PyTorch using AMD GPUs
12 March 2024 - Building a decoder transformer model on AMD GPU(s)
11 March 2024 - Question-answering Chatbot with LangChain on an AMD GPU
08 March 2024 - Music Generation With MusicGen on an AMD GPU
23 February 2024 - Efficient image generation with Stable Diffusion models and ONNX Runtime using AMD GPUs
08 February 2024 - Simplifying deep learning: A guide to PyTorch Lightning
07 February 2024 - Two-dimensional images to three-dimensional scene mapping using NeRF on an AMD GPU
05 February 2024 - Using LoRA for efficient fine-tuning: Fundamental principles
26 January 2024 - Pre-training BERT using Hugging Face & PyTorch on an AMD GPU
11 September 2023 - Creating a PyTorch/TensorFlow code environment on AMD GPUs
Posts tagged RAG
04 April 2024 - Retrieval Augmented Generation (RAG) using LlamaIndex
Posts tagged ResNet
09 April 2024 - ResNet for image classification using AMD GPUs
Posts tagged Scientific computing
26 January 2024 - Accelerating XGBoost with Dask using multiple AMD GPUs
03 November 2023 - Sparse matrix vector multiplication - part 1
15 September 2023 - Jacobi Solver with HIP and OpenMP offloading
18 July 2023 - Finite difference method - Laplacian part 4
08 June 2023 - GPU-aware MPI with ROCm
11 May 2023 - Finite difference method - Laplacian part 3
04 January 2023 - Finite difference method - Laplacian part 2
14 November 2022 - Finite difference method - Laplacian part 1
Posts tagged Segmentation
04 April 2024 - Total body segmentation using MONAI Deploy on an AMD GPU
Posts tagged Serving
04 April 2024 - Inferencing and serving with vLLM on AMD GPUs
Posts tagged Speech to Text
16 April 2024 - Speech-to-Text on an AMD GPU with Whisper
Posts tagged Stable Diffusion
17 April 2024 - Inferencing with AI2’s OLMo model on AMD GPU
01 April 2024 - Scale AI applications with Ray
Posts tagged TensorFlow
11 September 2023 - Creating a PyTorch/TensorFlow code environment on AMD GPUs
Posts tagged Tracing
Posts tagged Triton
15 April 2024 - Developing Triton Kernels on AMD GPUs
Posts tagged Tuning
26 April 2024 - Table Question-Answering with TaPas
26 April 2024 - Multimodal (Visual and Language) understanding with LLaVA-NeXT
16 April 2024 - Text Summarization with FLAN-T5
16 April 2024 - Affinity part 2 - System topology and controlling affinity
16 April 2024 - Affinity part 1 - Affinity, placement, and order
15 April 2024 - Enhancing LLM Accessibility: A Deep Dive into QLoRA Through Fine-tuning Llama 2 on a single AMD GPU
08 April 2024 - Small language models with Phi-2
01 April 2024 - Scale AI applications with Ray
15 March 2024 - Large language model inference optimizations on AMD GPUs
12 March 2024 - Building a decoder transformer model on AMD GPU(s)
11 March 2024 - Question-answering Chatbot with LangChain on an AMD GPU
08 February 2024 - Simplifying deep learning: A guide to PyTorch Lightning
05 February 2024 - Using LoRA for efficient fine-tuning: Fundamental principles
01 February 2024 - Fine-tune Llama 2 with LoRA: Customizing a large language model for question-answering
29 January 2024 - Pre-training BERT using Hugging Face & TensorFlow on an AMD GPU
26 January 2024 - Pre-training BERT using Hugging Face & PyTorch on an AMD GPU
25 January 2024 - LLM distributed supervised fine-tuning with JAX
Posts tagged Whisper
16 April 2024 - Speech-to-Text on an AMD GPU with Whisper