Xuanwu Yin

Xuanwu Yin#

Xuanwu Yin leads the model optimization team, driving work on model quantization, sparsity, speculative decoding, and efficient training/inference across multiple platforms. His team delivers high-performance, production-ready solutions for large language models, vision-language models, and image/video-generation pipelines, while providing direct support to customers.

Posts by Xuanwu Yin

https://rocm.blogs.amd.com/artificial-intelligence/amd-elvm/README.html
https://rocm.blogs.amd.com/artificial-intelligence/mxfp-t2i-t2v/README.html
https://rocm.blogs.amd.com/artificial-intelligence/spark-blog/README.html
https://rocm.blogs.amd.com/artificial-intelligence/tyr-the-pruner/README.html
https://rocm.blogs.amd.com/software-tools-optimization/gumiho/README.html
https://rocm.blogs.amd.com/artificial-intelligence/mlperf-inference-v5.1/README.html
https://rocm.blogs.amd.com/artificial-intelligence/mlperf-llama-pruning/README.html
https://rocm.blogs.amd.com/artificial-intelligence/mlperf-inference5.1-repro/README.html
https://rocm.blogs.amd.com/artificial-intelligence/elvm,-vlms,-llm,/README.html