Posts tagged Kernel

Developing Triton Kernels on AMD GPUs

15 April 2024

OpenAI has developed a powerful GPU focused programming language and compiler called Triton that works seamlessly with AMD GPUs. The goal of Triton is to enable AI engineers and scientists to write high-performant GPU code with minimal expertise. Triton kernels are performant because of their blocked program representation, allowing them to be compiled into highly optimized binary code. Triton also leverages Python for kernel development, making it both familiar and accessible. And the kernels can be easily compiled by simply declaring the triton.jit python decorator before the kernel.

Tags
AI/ML
C++
Compiler
Computer Vision
Generative AI
HPC
Inference
Installation
Julia
Kernel
LLM
Linear Algebra
MONAI
MPI
Memory
Mixed Precision
Mixtral
Mixture of Experts
Multimodal
NUMA
Natural Language Processing
NeRF
Neural Collaborative Filtering
OpenMP
Optimization
Partner Applications
Performance
Profiling
Programming Languages
PyTorch
RAG
ResNet
Scientific computing
Segmentation
Serving
Speech to Text
Stable Diffusion
TensorFlow
Tracing