Jiahui Cao

Jiahui Cao#

Jiahui is a Sr. Software Engineer at AMD specializing in GPU performance optimization and ML inference systems. His work focuses on kernel optimization on AMD architectures, large-scale LLM inference benchmarking, and end-to-end performance analysis. Jiahui has a solid background in GPU programming, parallel computing, and numerical optimization, with hands-on experience in CUDA/HIP, Triton, and performance profiling tools. He holds a master’s degree in Computer Science and Engineering from the Santa Clara University.

Posts by Jiahui Cao

https://rocm.blogs.amd.com/software-tools-optimization/cdna4-gemm-kernels/README.html