Data Science - Applications & Models

Data Science - Applications & Models#

Benchmarking Reasoning Models: From Tokens to Answers

Learn how to benchmark reasoning tasks. Use Qwen3 and vLLM to test true reasoning performance, not just how fast words are generated.

July 24, 2025 by Dominic Widdows

Accelerate DeepSeek-R1 Inference: Integrate AITER into SGLang

Boost DeepSeek-R1 with AITER: Step-by-step SGLang integration for high-performance MoE, GEMM, and attention ops on AMD GPUs

May 16, 2025 by Bruce Xue, George Wang

Accelerated JPEG decoding on AMD Instinct™ GPUs with rocJPEG

Learn how to decompress JPEG files at breakneck speeds for your AI, vision, and content delivery workloads using rocJPEG and AMD Instinct GPUs.

May 12, 2025 by Marco Grond

Seismic stencil codes - part 2

Seismic Stencil Codes - Part 2: In the previous post, recall that the kernel with stencil computation in the z-direction suffered from low effective bandwidth. This low performance comes from generating substantial amounts of data to movement to global memory.

August 29, 2024 by Justin Chang, Ossian O'Reilly

Seismic stencil codes - part 3

Seismic Stencil Codes - Part 3: In the last two blog posts, we developed a HIP kernel capable of computing high order finite differences commonly needed in seismic wave propagation.

August 29, 2024 by Justin Chang, Ossian O'Reilly