Eliot Li

Eliot Li#

Eliot is a Senior Data Science Manager at AMD. He is passionate about advancing the state of the art in machine learning. He has over two decades of experience in building products that creates enormous amount of value for customers in large scale marketplaces using machine learning and market design principles. Eliot holds a PhD in Electrical Engineering from Yale University, and a BA from Oxford University.

Posts by Eliot Li

Accelerating llama.cpp on AMD Instinct MI300X

Learn more about the superior performance of llama.cpp on Instinct platforms.

December 11, 2025 by Pei Zhang, Deepan Sekar, Eliot Li, Yao Liu, Phani Vaddadi, Vish Vadlamani

DGL in Depth: SE(3)-Transformer on ROCm 7

Inform the AI community about running SE(3)-Transformer with DGL on AMD Instinct platforms.

December 05, 2025 by Anuya Welling, James E. T. Smith, Geoffrey C. Martin-Noble, Tres Popp, Eliot Li, Mukhil Azhagan Mallaiyan Sathiaseelan, Yao Liu, Phani Vaddadi, Vish Vadlamani

Plug-and-Play CuPy on ROCm: Data Analytics Acceleration Made Simple

Learn about how to enhance your analytics project with the latest AMD CuPy release.

November 14, 2025 by Grant Pinkert, Eliot Li

Accelerating Vector Search: hipVS and hipRAFT on AMD

Learn how hipVS accelerates vector search on AMD Instinct GPUs, with notebook demos for semantic search, RAG, and recommendation systems.

November 13, 2025 by Sukriti Choudhary, Sujin Philip, Kevin Joseph, Fabricio Flores, Eliot Li, Lalith Narasimhan, Phani Vaddadi, Vish Vadlamani

Reproducing AMD MLPerf Training v5.1 Submission Result

Learn how to reproduce AMD's MLPerf Training v5.1 submission result.

November 12, 2025 by Meena Arunachalam, Miro Hodak, Ravi Dwivedula, Sarthak Arora, Sathish Sanjeevi, Su Ann Chong, Karan Verma, Eliot Li

Technical Dive into AMD MLPerf Training v5.1 Submission

Learn about the technical details of how AMD achieved the results in the MLPerf Training v5.1 submission.

November 12, 2025 by Meena Arunachalam, Miro Hodak, Ravi Dwivedula, Sarthak Arora, Sathish Sanjeevi, Su Ann Chong, Karan Verma, Eliot Li

From Ingestion to Inference: RAG Pipelines on AMD GPUs

Build a RAG enhanced GenAI application that improves the quality of model responses by incorporating data that is missing in the model training data.

October 02, 2025 by Lin Sun, Anuya Welling, Fabricio Flores, Eliot Li, Yao Liu, Phani Vaddadi, Vish Vadlamani

Technical Dive into AMD's MLPerf Inference v5.1 Submission

In this blog, we share the technical details of how we accomplish the results in our MLPerf Inference v5.1 submission.

September 09, 2025 by Meena Arunachalam, Miro Hodak, Poovaiah Palangappa, Wei-Ting Liao, Uma Kannikanti, Fulu Li, Neha Mathews, Rajesh Poornachandran, Ean Garvey, Kumar Deepak, Yixing Xu, Zhe Li, Guanchen Li, Xuanwu Yin, Dong Li, Zhao Lin, Wei Luo, Bowen Bao, Spandan Tiwari, Niels Zhang, Vinayak Gokhale, Clint Greene, Eliot Li

Slim Down Your Llama: Pruning & Fine-Tuning for Maximum Performance

This blog describes the technical details of how we prune and fine tune the Llama 3.1 405B model in our MLPerf Inference v5.1 submission.

September 09, 2025 by Meena Arunachalam, Miro Hodak, Poovaiah Palangappa, Fulu Li, Yixing Xu, Zhe Li, Guanchen Li, Xuanwu Yin, Dong Li, Karan Verma, Clint Greene, Eliot Li

Reproducing the AMD Instinct™ GPUs MLPerf Inference v5.1 Submission

In this blog, we will provide step by step instruction on how to reproduce AMD's MLPerf Inference v5.1 Submission

September 09, 2025 by Meena Arunachalam, Miro Hodak, Poovaiah Palangappa, Wei-Ting Liao, Uma Kannikanti, Fulu Li, Karan Verma, Neha Mathews, Yamini Kamisetty, Chelsea Iluno, Ean Garvey, Kumar Deepak, Yixing Xu, Zhe Li, Guanchen Li, Xuanwu Yin, Dong Li, Clint Greene, Eliot Li

Llama.cpp Meets Instinct: A New Era of Open-Source AI Acceleration

performance optimizations for llama.cpp on AMD Instinct GPUs

September 09, 2025 by Deepan Sekar, Pei Zhang, Eliot Li, Yao Liu, Phani Vaddadi, Vish Vadlamani

Reproduce AMD's MLPerf Training v5.0 Submission Result with Instinct™ GPUs

Follow this step-by-step guide to reproduce AMDs MLPerf 5.0 Training Submission with Instinct GPUs using ROCm

June 04, 2025 by Meena Arunachalam, Miro Hodak, Ravi Dwivedula, Su Ann Chong, Sarthak Arora, Sathish Sanjeevi, Karan Verma, Eliot Li

AMD’s MLPerf Training Debut: Optimizing LLM Fine-Tuning with Instinct™ GPUs

Explore the techniques we used to improve the training performance on MI300X and MI325X in our MLPerf Training 5.0 submission.

June 04, 2025 by Meena Arunachalam, Miro Hodak, Ravi Dwivedula, Sarthak Arora, Sathish Sanjeevi, Su Ann Chong, Karan Verma, Eliot Li

High-Throughput BERT-L Pre-Training on AMD Instinct™ GPUs: A Practical Guide

Learn how to optimize BERT-L training with mixed precision and Flash Attention v2 on AMD Instinct GPUs — follow our tested MLPerf-compliant step-by-step guide.

June 03, 2025 by Meena Arunachalam, Miro Hodak, Ravi Dwivedula, Su Ann Chong, Sarthak Arora, Sathish Sanjeevi, Karan Verma, Eliot Li

Scale LLM Inference with Multi-Node Infrastructure

Learn how to horizontally scale LLM inference using open-source tools on MI300X, with vLLM, nginx, Prometheus, and Grafana.

May 30, 2025 by Jorge Parada, Eliot Li

AMD Instinct™ MI325X GPUs Produce Strong Performance in MLPerf Inference v5.0

We showcase MI325X GPU optimizations that power our MLPerf v5.0 results on Llama 2 70B, highlighting performance tuning, quantization, and vLLM advancements.

April 02, 2025 by Meena Arunachalam, Miro Hodak, Wei-Ting Liao, Poovaiah Palangappa, Eliot Li, AMD Quark Team, AMD Brevitas Team, and AMD Shark Team

Reproducing the AMD Instinct™ GPUs MLPerf Inference v5.0 Submission

A step-by-step guide to reproducing AMD’s MLPerf v5.0 results for Llama 2 70B & SDXL using ROCm on MI325X

April 02, 2025 by Meena Arunachalam, Miro Hodak, Wei-Ting Liao, Karan Verma, Ean Garvey, Kumar Deepak, Giuseppe Franco, Eliot Li, AMD Quark team

Triton Inference Server with vLLM on AMD GPUs

This blog provides a how-to guide on setting up a Triton Inference Server with vLLM backend powered by AMD GPUs, showcasing robust performance with several LLMs

January 08, 2025 by Fabricio Flores, Tiffany Mintz, Eliot Li, Yao Liu, Ted Themistokleous, Brian Pickrell, Vish Vadlamani

Benchmarking Machine Learning using ROCm and AMD GPUs: Reproducing Our MLPerf Inference Submission

August 28, 2024 by Meena Arunachalam, Miro Hodak, Jeremy Arnold, Eliot Li

Performing natural language processing tasks with LLMs on ROCm running on AMD GPUs

August 21, 2024 by Eliot Li

Inferencing with Grok-1 on AMD GPUs

We demonstrate that the massive Grok-1 Model from xAI can run seamlessly on the AMD MI300X GPU accelerator by leveraging the ROCm software platform.

August 09, 2024 by Eliot Li, Luise Chen, Lei Shao

Image classification using Vision Transformer with AMD GPUs

April 04, 2024 by Eliot Li

Scale AI applications with Ray

April 01, 2024 by Vicky Tsang, Logan Grado, Eliot Li