Skip to main content
Ctrl+K
AMD Logo
ROCm™ Blogs
  • Home
  • AI
  • HPC
  • Data Science
  • Systems
  • Developers
  • Robotics

ROCm blogs

Posts by Aditi Ghai Rana

Productionizing TurboQuant on AMD GPUs for KV-Cache-Bound LLM Inference

  • 11 June 2026
  • Inesh Chakrabarti , David Limpus , Aditi Ghai Rana , Bowen Bao , Spandan Tiwari , Thiago Crepaldi , Ashish Sirasao
  • English
  • Applications & models
  • AI/ML LLM Performance Memory

*The first three authors (Chakrabarti, Limpus, Rana) contributed equally to this work.

Read more ...


  • Terms and Conditions
  • Privacy
  • Trademarks
  • Supply Chain Transparency
  • Fair and Open Competition
  • UK Tax Strategy
  • Cookie Policy
  • Cookie Settings
© 2025 Advanced Micro Devices, Inc