Jiahui Cao#
Jiahui is a Sr. Software Engineer at AMD specializing in GPU performance optimization and ML inference systems. His work focuses on kernel optimization on AMD architectures, large-scale LLM inference benchmarking, and end-to-end performance analysis. Jiahui has a solid background in GPU programming, parallel computing, and numerical optimization, with hands-on experience in CUDA/HIP, Triton, and performance profiling tools. He holds a master’s degree in Computer Science and Engineering from the Santa Clara University.