AMD

Senior Cluster Performance Engineer

Job Description

Posted on: 
2026-01-12

Responsibilities

  • Collaborate with teams to enhance GPU cluster performance, focusing on RDMA throughput and latency.
  • Develop and execute benchmarking strategies to assess performance and identify bottlenecks.
  • Conduct scalability testing of GPU clusters under various workloads.
  • Utilize profiling tools to analyze performance bottlenecks and provide insights.
  • Implement optimization strategies for performance tuning.
  • Create detailed documentation of performance analysis and tuning efforts.
  • Stay current with developments in GPU architectures and parallel processing.

Job Requirements

  • Proven experience in optimizing GPU cluster performance.
  • Understanding of RDMA network drivers and GPU architectures.
  • Proficiency in scripting languages for automation and analysis.
  • Experience with performance analysis tools for GPU clusters.
  • Strong problem-solving and debugging skills.
  • Familiarity with cluster management tools and systems.
  • Bachelor's or Master’s degree in computer science or equivalent experience.
Apply now

More job openings