Haishuo Kong#
Haishuo Kong is an MTS Software Development Engineer on the Training at Scale team, focused on building observability systems for large-scale training clusters. He has developed observability platforms for both CPU and GPU environments and has extensive experience supporting large-scale model training workloads.