Uma Kannikanti#
Uma is a Principal Member of Technical Staff at AMD, where he specializes in optimizing inference performance for large language models (LLMs) on AMD Instinct™ GPUs. As the inference lead on the MLPerf team, he leads workload optimization efforts and contributes to enhancing the inference software stack.