Posts by David Limpus
Productionizing TurboQuant on AMD GPUs for KV-Cache-Bound LLM Inference
- 11 June 2026
*The first three authors (Chakrabarti, Limpus, Rana) contributed equally to this work.
*The first three authors (Chakrabarti, Limpus, Rana) contributed equally to this work.